t 



Appln. No. 10/669,824 J 
..Docket.No..5.1A442001.62n' 



(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 

International Bureau 

(43) International Publication Date 
25 May 2001 (25.05.2001) 




PCT 



111 



(10) International Publication Number 

WO 01/36444 Al 



(51) International Patent Classification 7 : 
C12N 5/10, 15/29, 15/63, 15/82 



C07H 21/04, 



(21) International Application Number: PCT/USOO/31325 

(22) International Filing Date: 

14 November 2000 (14.1 1.2000) 



(25) Filing Language: 

(26) Publication Language: 



English 
English 



(30) Priority Data: 

60/166,228 
60/197,899 
60/227,439 



17 November 1999 (17.1 1.1999) US 
17 April 2000 (17.04.2000) US 
22 August 2000 (22.08.2000) US 



(71) Applicant (for all designated Slates except US): 
MENDEL BIOTECHNOLOGY, INC. [US/US], 21375 
Cabot Boulevard, Hay ward, CA 94541 (US). 

(71) Applicants and 

(72) Inventors: RIECHMANN, Jose Luis [ES/US]; 1 15 Moss 
Avenue #308, Oakland, CA 9461 1 (US). REUBER, Lynne 
[US/US]; 2000 Walnut Avenue, Fremont, CA 94538 (US). 
KEDDIE, James [GB/US]; 54 McLellan Avenue, San 
Mateo, CA 94403 (US). RATCLIFFE, Oliver [GB/US]; 
814 East 21st Street, Oakland, CA 94606 (US). HEARD, 
Jacqueline [US/US]; 810 Guildford Avenue, San Mateo, 
CA 94402 (US). SAMAHA, Raymond [US/US]; 2224 
Albert Lane, Capitola, CA 95010 (US). YU, Guo-Liang 



[CN/US]; 242 Gravatt Drive, Berkeley, CA 94705 (US). 
JIANG, Cai-Zhong [CN/US]; 34495 Heathrow Terrace, 
Fremont, CA 94555 (US). 

(74) Common Representative: MENDEL BIOTECHNOL- 
OGY, INC.; Guerrero, Karen, 21375 Cabot Boulevard, 
Hayward, CA 94545 (US). 

(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CR, CU, CZ, 
DE, DK, DM, DZ, EE, ES, FI, GB, GD, GE, GH, GM, HR, 
HU, ID, IL, IN, IS, JP, KB, KG, KP, KR, KZ, LC, LK, LR, 
LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, MX, MZ, 
NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK, SL, TJ, TM, 
TR, TT, TZ, UA, UG, US, UZ, VN, YU, ZA, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM). European 
patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, 
IT, LU, MC, NL, PT, SE, TR), OAPI patent (BF, BJ, CF, 
CG, CI, CM, GA, GN, GW, ML, MR, NE, SN, TD, TG). 

Published: 

— With international search report. 

— Before the expiration of the time limit for amending the 
claims and to be republished in the event of receipt of 
amendments. 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette, 



r— I 

O (54) Title: PLANT DEVELOPMENTAL GENES 

O 



^ (57) Abstract: Recombinant polynucleotides and methods for modifying the phenotype of a plant are provided. In particular, the 



phenotype that is being modified is a plant's structure and development characteristics. 



WO 01/36444 



PCT/US00/31325 



PLANT DEVELOPMENTAL GENES 

RELATED APPLICATION INFORMATION 

The present invention claims the benefit from US Provisional Patent Application Serial 

Nos. 60/166,228 filed November 17, 1999 and 60/197,899 filed April 17, 2000 and "Plant Trait 
Modification HI" filed August 22, 2000. 

FIELD OF THE INVENTION 

This invention relates to the field of plant biology. More particularly, the present 

invention pertains to compositions and methods for phenotypically modifying a plant. 

BACKGROUND OF THE INVENTION 

Transcription factors can modulate gene expression, either increasing or 

decreasing (inducing or repressing) the rate of transcription. This modulation results in 
differential levels of gene expression at various developmental stages, in different tissues and cell 
types, and in response to different exogenous (e.g., environmental) and endogenous stimuli 
throughout the life cycle of the organism. 

Because transcription factors are key controlling elements of biological 
pathways, altering the expression levels of one or more transcription factors can change entire 
biological pathways in an organism. For example, manipulation of the levels of selected 
transcription factors may result in increased expression of economically useful proteins or 
metabolic chemicals in plants or to improve other agriculturally relevant characteristics. 
Conversely, blocked or reduced expression of a transcription factor may reduce biosynthesis of 
unwanted compounds or remove an undesirable trait. Therefore, manipulating transcription 
factor levels in a plant offers tremendous potential in agricultural biotechnology for modifying a 
plant's traits. 

The present invention provides novel transcription factors useful for modifying a 
plant's phenotype in desirable ways, such as modifying a plant's structure or development. 

SUMMARY OF THE INVENTION 

In a first aspect, the invention relates to a recombinant polynucleotide comprising 

a nucleotide sequence selected from the group consisting of: (a) a nucleotide sequence encoding a 
polypeptide comprising a sequence selected from SEQ ID Nos. 2N, where N=l-23, or a 
complementary nucleotide sequence thereof; (b) a nucleotide sequence encoding a polypeptide 
comprising a conservatively substituted variant of a polypeptide of (a); (c) a nucleotide sequence 
comprising a sequence selected from those of SEQ ID Nos. 2N-1, where N=l-23, or a 
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complementary nucleotide sequence thereof; (d) a nucleotide sequence comprising silent 
substitutions in a nucleotide sequence of (c); (e) a nucleotide sequence which hybridizes under 
stringent conditions over substantially the entire length of a nucleotide sequence of one or more 
of: (a), (b), (c), or (d); (f) a nucleotide sequence comprising at least 15 consecutive nucleotides of 
5 a sequence of any of (a)-(e); (g) a nucleotide sequence comprising a subsequence or fragment of 
any of (a)-(f), which subsequence or fragment encodes a polypeptide having a biological activity 
that modifies a plant's structure and development characteristics; (h) a nucleotide sequence 
having at least 3 1% sequence identity to a nucleotide sequence of any of (a)-(g); (i) a nucleotide 
sequence having at least 60% identity sequence identity to a nucleotide sequence of any of (a)- 

1 0 (g)i 0) a nucleotide sequence which encodes a polypeptide having at least 31% identity sequence 
identity to a polypeptide of SEQ ID Nos. 2N, where N=l-23; (k) a nucleotide sequence which 
encodes a polypeptide having at least 60% identity sequence identity to a polypeptide of SEQ ID 
Nos. 2N, where N=l-23; and (1) a nucleotide sequence which encodes a conserved domain of a 
polypeptide having at least 65% sequence identity to a conserved domain of a polypeptide of 

1 5 SEQ ID Nos. 2N, where N=l -23. The recombinant polynucleotide may further comprise a 

constitutive, inducible, or tissue-active promoter operably linked to the nucleotide sequence. The 
invention also relates to compositions comprising at least two of the above described 
polynucleotides. 

In a second aspect, the invention is an isolated or recombinant polypeptide 
20 comprising a subsequence of at least about 1 5 contiguous amino acids encoded by the 
recombinant or isolated polynucleotide described above. 

In another aspect, the invention is a transgenic plant comprising one or more of 
the above described recombinant polynucleotides. In yet another aspect, the invention is a plant 
with altered expression levels of a polynucleotide described above or a plant with altered 
25 expression or activity levels of an above described polypeptide. Further, the invention is a plant 
lacking a nucleotide sequence encoding a polypeptide described above. The plant may be a 
soybean, wheat, corn, potato, cotton, rice, oilseed rape, sunflower, alfalfa, sugarcane, turf, 
banana, blackberry, blueberry, strawberry, raspberry, cantaloupe, carrot, cauliflower, coffee, 
cucumber, eggplant, grapes, honeydew, lettuce, mango, melon, onion, papaya, peas, peppers, 
30 pineapple, spinach, squash, sweet corn, tobacco, tomato, watermelon, rosaceous fruits, or 
vegetable brassicas plant. 

In a further aspect, the invention relates to a cloning or expression vector 
comprising the isolated or recombinant polynucleotide described above or cells comprising the 
cloning or expression vector. 
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In yet a further aspect, the invention relates to a composition produced by 
incubating a polynucleotide of the invention with a nuclease, a restriction enzyme, a polymerase; 
a polymerase and a primer; a cloning vector, or with a cell. 

Furthermore, the invention relates to a method for producing a plant having 
5 modified structure and development traits. The method comprises altering the expression of an 
isolated or recombinant polynucleotide of the invention or altering the expression or activity of a 
polypeptide of the invention in a plant to produce a modified plant, and selecting the modified 
plant for modified structure and development traits. 

In another aspect, the invention relates to a method of identifying a factor that is 
10 modulated by or interacts with a polypeptide encoded by a polynucleotide of the invention. The 
method comprises expressing a polypeptide encoded by the polynucleotide in a plant; and 
identifying at least one factor that is modulated by or interacts with the polypeptide. In one 
embodiment the method for identifying modulating or interacting factors is by detecting binding 
by the polypeptide to a promoter sequence, or by detecting interactions between an additional 
15 protein and the polypeptide in a yeast two hybrid system, or by detecting expression of a factor by 
hybridization to a microarray, subtractive hybridization or differential display. 

In yet another aspect, the invention is a method of identifying a molecule that 
modulates activity or expression of a polynucleotide or polypeptide of interest The method 
comprises placing the molecule in contact with a plant comprising the polynucleotide or 
20 polypeptide encoded by the polynucleotide of the invention and monitoring one or more of the 
expression level of the polynucleotide in the plant, the expression level of the polypeptide in the 
plant, and modulation of an activity of the polypeptide in the plant. 

In yet another aspect, the invention relates to an integrated system, computer or 
computer readable medium comprising one or more character strings corresponding to a 
25 polynucleotide of the invention, or to a polypeptide encoded by the polynucleotide. The 

integrated system, computer or computer readable medium may comprise a link between one or 
more sequence strings to a modified plant structure and development trait. 

In yet another aspect, the invention is a method for identifying a sequence similar 
or homologous to one or more polynucleotides of the invention, or one or more polypeptides 
30 encoded by the polynucleotides. The method comprises providing a sequence database; and, 
querying the sequence database with one or more target sequences corresponding to the one or 
more polynucleotides or to the one or more polypeptides to identify one or more sequence 
members of the database that display sequence similarity or homology to one or more of the one 
or more target sequences. 
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The method may further comprise of linking the one or more of the 
polynucleotides of the invention, or encoded polypeptides, to a modified plant structure and 
development characteristics phenotype. 

BRIEF DESCRIPTION OF THE DRAWINGS 

5 Figure I provides a table of exemplary polynucleotide and polypeptide sequences of the 

invention. The table includes from left to right for each sequence: the SEQ ID No., the internal 
code reference number (GID), whether the sequence is a polynucleotide or polypeptide sequence, 
and identification of any conserved domains for the polypeptide sequences. 

Figure 2 provides a table of exemplary sequences that are homologous to other sequences 
10 provided in the Sequence Listing and that are derived from Arabidopsis thaliana. The table 
includes from left to right: the SEQ ID No., the internal code reference number (GID), 
identification of the homologous sequence, whether the sequence is a polynucleotide or 
polypeptide sequence, and identification of any conserved domains for the polypeptide 
sequences. 

15 Figure 3 provides a table of exemplary sequences that are homologous to the sequences 

provided in Figures 1 and 2 and that are derived from plants other than Arabidopsis thaliana. The 
table includes from left to right: the SEQ ID No,, the internal code reference number (GID), the 
unique GenBank sequence ID No. (NID), the probability that the comparison was generated by 
chance (P-value), and the species from which the homologous gene was identified, 

20 

DETAILED DESCRIPTION 

The present invention relates to polynucleotides and polypeptides, e.g. for 

modifying phenotypes of plants. 

In particular, the polynucleotides or polypeptides are useful for modi lying traits 

25 associated with a plant f s structure or development characteristics when the expression levels of 
the polynucleotides or expression levels or activity levels of the polypeptides are altered. 
Specifically, the polynucleotides and polypeptides are useful for modifying the structure and size 
of flowers, leaves, roots, the plant as a whole, or the like, apical dominance, branching patterns, 
number of organs, organ identity, whether a plant is sterile or not, the vascularization of a plant, 

30 or the developmental staging of a plant, such as when senescence is triggered. 

The polynucleotides of the invention encode plant transcription factors. The plant 
transcription factors are derived, e.g., from Arabidopsis thaliana and can belong, e.g., to one or 
more of the following transcription factor families: the AP2 (APETALA2) domain transcription 
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factor family (Riechmann and Meyerowitz (1998) J. Biol. Chem. 379:633-646); the MYB 
transcription factor family (Martin and Paz-Ares (1997) Trends Genet, 13:67-73); the MADS 
domain transcription factor family (Riechmann and Meyerowitz (1997) J. Biol Chem 378:1079- 
1 101); the WRKY protein family (Ishiguro and Nakamura (1994) Mol. Gen. Genet. 244:563- 
5 571); the ankyrin-repeat protein family (Zhang et al. (1992) Plant Cell 4: 1 575-1 588); the 

miscellaneous protein (MISC) family (Kim et al. (1997) Plant J. 1 1:1237-1251); the zinc finger 
protein (Z) family (Klug and Schwabe (1995) FASEB J. 9: 597-604); the homeobox (HB) protein 
family (Duboule (1994) Guidebook to the Homeobox Genes. Oxford University Press); the 
CAAT-element binding proteins (Forsburg and Guarente (1989) Genes Dev. 3: 1 166-1 178); the 

10 squamosa promoter binding proteins (SPB) (Klein et al. (1996) Mol. Gen. Genet. 1996 250:7-16); 
the NAM protein family; the IAA/AUX proteins (Rouse et al, (1998) Science 279:1371-1373); 
the HLH/MYC protein family (Littlewood et al. (1994) Prot. Profile 1 :639-709); the DNA- 
binding protein (DBP) family (Tucker et al. (1994) EMBO J. 13:2994-3002); the bZIP family of 
transcription factors (Foster et al. (1994) FASEB J. 8:192-200); the BPF-1 protein (Box P- 

15 binding factor) family (da Costa e Silva et al. (1993) Plant J. 4:125-135); and the golden protein 
(GLD) family (Hall et al. (1998) Plant Cell 10:925-936). 

In addition to methods for modifying a plant phenotype by employing one or 
more polynucleotides and polypeptides of the invention described herein, the polynucleotides 
and polypeptides of the invention have a variety of additional uses. These uses include their use 

20 in the recombinant production (i.e, expression) of proteins; as regulators of plant gene expression, 
as diagnostic probes for the presence of complementary or partially complementary nucleic acids 
(including for detection of natural coding nucleic acids); as substrates for further reactions, e.g., 
mutation reactions, PCR reactions, or the like, of as substrates for cloning e.g., including 
digestion or ligation reactions, and for identifying exogenous or endogenous modulators of the 
25 transcription factors. 

DEFINITIONS 

A "polynucleotide' 1 is a nucleic acid sequence comprising a plurality of 
polymerized nucleotide residues, e.g., at least about 15 consecutive polymerized nucleotide 
residues, optionally at least about 30 consecutive nucleotides, at least about 50 consecutive 
30 nucleotides. In many instances, a polynucleotide comprises a nucleotide sequence encoding a 
polypeptide (or protein) or a domain or fragment thereof. Additionally, the polynucleotide may 
comprise a promoter, an intron, an enhancer region, a polyadenylation site, a translation initiation 
site, 5* or 3' untranslated regions, a reporter gene, a selectable marker, or the like. The 
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polynucleotide can be single stranded or double stranded DNA or RNA. The polynucleotide 
optionally comprises modified bases or a modified backbone. The polynucleotide can be, e.g., 
genomic DNA or RNA, a transcript (such as an mRNA), a cDNA, a PCR product, a cloned DNA, 
a synthetic DNA or RNA, or the like. The polynucleotide can comprise a sequence in either 
5 sense or antisense orientations. 

A "recombinant polynucleotide" is a polynucleotide that is not in its native state, 
e.g., the polynucleotide comprises a nucleotide sequence not found in nature, or the 
polynucleotide is in a context other than that in which it is naturally found, e.g., separated from 
nucleotide sequences with which it typically is in proximity in nature, or adjacent (or contiguous 
10 with) nucleotide sequences with which it typically is not in proximity. For example, the sequence 
at issue can be cloned into a vector, or otherwise recombined with one or more additional nucleic 
acid. 

An "isolated polynucleotide" is a polynucleotide whether naturally occurring or 
recombinant, that is present outside the cell in which it is typically found in nature, whether 

1 5 purified or not. Optionally, an isolated polynucleotide is subject to one or more enrichment or 
purification procedures, e.g., cell lysis, extraction, centrifugation, precipitation, or the like. 

A "recombinant polypeptide" is a polypeptide produced by translation of a 
recombinant polynucleotide. An "isolated polypeptide," whether a naturally occurring or a 
recombinant polypeptide, is more enriched in (or out of) a cell than the polypeptide in its natural 

20 state in a wild type cell, e.g., more than about 5% enriched, more than about 10% enriched, or 
more than about 20%, or more than about 50%, or more, enriched, i.e., alternatively denoted: 
105%, 1 10%, 120%, 150% or more, enriched relative to wild type standardized at 100%. Such an 
enrichment is not the result of a natural response of a wild type plant. Alternatively, or 
additionally, the isolated polypeptide is separated from other cellular components with which it is 

25 typically associated, e.g., by any of the various protein purification methods herein. 

The term "transgenic plant" refers to a plant that contains genetic material, not 
found in a wild type plant of the same species, variety or cultivar. The genetic material may 
include a transgene, an insertional mutagenesis event (such as by transposon or T-DNA 
insertional mutagenesis), an activation tagging sequence, a mutated sequence, a homologous 

30 recombination event or a sequence modified by chimeraplasty. Typically, the foreign genetic 
material has been introduced into the plant by human manipulation. 

A transgenic plant may contain an expression vector or cassette. The expression 
cassette typically comprises a polypeptide-encoding sequence operably linked (i.e., under 
regulatory control of) to appropriate inducible or constitutive regulatory sequences that allow for 
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the expression of polypeptide. The expression cassette can be introduced into a plant by 
transformation or by breeding after transformation of a parent plant. A plant refers to a whole 
plant as well as to a plant part, such as seed, fruit, leaf, or root, plant tissue, plant cells or any 
other plant material, e.g., a plant explant, as well as to progeny thereof, and to in vitro systems 
5 that mimic biochemical or cellular components or processes in a cell. 

The phrase "ectopically expression or altered expression*' in reference to a 
polynucleotide indicates that the pattern of expression in, e.g., a transgenic plant or plant tissue, is 
different from the expression pattern in a wild type plant or a reference plant of the same species. 
For example, the polynucleotide or polypeptide is expressed in a cell or tissue type other than a 

1 0 cell or tissue type in which the sequence is expressed in the wild type plant, or by expression at a 
time other than at the time the sequence is expressed in the wild type plant, or by a response to 
different inducible agents, such as hormones or environmental signals, or at different expression 
levels (either higher or lower) compared with those found in a wild type plant. The term also 
refers to altered expression patterns that are produced by lowering the levels of expression to 

1 5 below the detection level or completely abolishing expression. The resulting expression pattern 
can be transient or stable, constitutive or inducible. In reference to a polypeptide, the term 
"ectopic expression or altered expression" further may relate to altered activity levels resulting 
from the interactions of the polypeptides with exogenous or endogenous modulators or from 
interactions with factors or as a result of the chemical modification of the polypeptides. 

20 The term "fragment" or "domain," with respect to a polypeptide, refers to a 

subsequence of the polypeptide. In some cases, the fragment or domain, is a subsequence of the 
polypeptide which performs at least one biological function of the intact polypeptide in 
substantially the same manner, or to a similar extent, as does the intact polypeptide. For example, 
a polypeptide fragment can comprise a recognizable structural motif or functional domain such as 

25 a DNA binding domain that binds to a DNA promoter region, an activation domain or a domain 
for protein-protein interactions. Fragments can vary in size from as few as 6 amino acids to the 
full length of the intact polypeptide, but are preferably at least about 30 amino acids in length and 
more preferably at least about 60 amino acids in length. In reference to a nucleotide sequence, "a 
fragment" refers to any subsequence of a polynucleotide, typically, of at least consecutive about 

30 1 5 nucleotides, preferably at least about 30 nucleotides, more preferably at least about 50, of any 
of the sequences provided herein. 

The term "trait" refers to a physiological, morphological, biochemical or physical 
characteristic of a plant or particular plant material or cell. In some instances, this characteristic 
is visible to the human eye, such as seed or plant size, or can be measured by available 
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biochemical techniques, such as the protein, starch or oil content of seed or leaves or by the 
observation of the expression level of genes, e.g., by employing Northern analysis, RT-PCR, 
microarray gene expression assays or reporter gene expression systems, or by agricultural 
observations such as stress tolerance, yield or pathogen tolerance. 
5 "Trait modification" refers to a detectable difference in a characteristic in a plant 

ectopically expressing a polynucleotide or polypeptide of the present invention relative to a plant 
not doing so, such as a wild type plant. In some cases, the trait modification can be evaluated 
quantitatively. For example, the trait modification can entail at least about a 2% increase or 
decrease in an observed trait (difference), at least a 5% difference, at least about a 10% 

10 difference, at least about a 20% difference, at least about a 30%, at least about a 50%, at least 

about a 70%, or at least about a 100%, or an even greater difference. It is known that there can be 
a natural variation in the modified trait. Therefore, the trait modification observed entails a 
change of the normal distribution of the trait in the plants compared with the distribution 
observed in wild type plant. 

1 5 Trait modifications of particular interest include those to seed ( such as embryo 

or endosperm), fruit, root, flower, leaf, stem, shoot, seedling or the like, including: enhanced 
tolerance to environmental conditions including freezing, chilling, heat, drought, water saturation, 
radiation and ozone; improved tolerance to microbial, fungal or viral diseases; improved 
tolerance to pest infestations, including nematodes, mollicutes, parasitic higher plants or the like; 

20 decreased herbicide sensitivity; improved tolerance of heavy metals or enhanced ability to take up 
heavy metals; improved growth under poor photoconditions (e.g., low light and/or short day 
length), or changes in expression levels of genes of interest. Other phenotype that can be 
modified relate to the production of plant metabolites, such as variations in the production of 
taxol, tocopherol, tocotrienol, sterols, phytosterols, vitamins, wax monomers, anti-oxidants, 

25 amino acids, lignins, cellulose, tannins, prenyllipids (such as chlorophylls and carotenoids), 
glucosinolates, and terpenoids, enhanced or compositionally altered protein or oil production 
(especially in seeds), or modified sugar (insoluble or soluble) and/or starch composition. 
Physical plant characteristics that can be modified include cell development (such as the number 
of trichomes), fruit and seed size and number, yields of plant parts such as stems, leaves and 

30 roots, the stability of the seeds during storage, characteristics of the seed pod (e.g., susceptibility 
to shattering), root hair length and quantity, internode distances, or the quality of seed coat. Plant 
growth characteristics that can be modified include growth rate, germination rate of seeds, vigor 
of plants and seedlings, leaf and flower senescence, male sterility, apomixis, flowering time, 
flower abscission, rate of nitrogen uptake, biomass or transpiration characteristics, as well as 
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plant architecture characteristics such as apical dominance, branching patterns, number of organs, 
organ identity, organ shape or size. 

POLYPEPTIDES AND POLYNUCLEOTIDES OF THE INVENTION 

The present invention provides, among other things, transcription factors (TFs), 
5 and transcription factor homologue polypeptides, and isolated or recombinant polynucleotides 
encoding the polypeptides. These polypeptides and polynucleotides may be employed to modify 
a plant's structure and development characteristics. 

Exemplary polynucleotides encoding the polypeptides of the invention were 
identified in the Arabidopsis thaliana GenBank database using publicly available sequence 
10 analysis programs and parameters. Sequences initially identified were then further characterized 
to identify sequences comprising specified sequence strings corresponding to sequence motifs 
present in families of known transcription factors. Polynucleotide sequences meeting such 
criteria were confirmed as transcription factors. 

Additional polynucleotides of the invention were identified by screening 
15 Arabidopsis thaliana and/or other plant cDNA libraries with probes corresponding to known 
transcription factors under low stringency hybridization conditions. Additional sequences, 
including full length coding sequences were subsequently recovered by the rapid amplification of 
cDNA ends (RACE) procedure, using a commercially available kit according to the 
manufacturer's instructions. Where necessary, multiple rounds of RACE are performed to isolate 
20 5' and 3' ends. The full length cDNA was then recovered by a routine end-to-end polymerase 
chain reaction (PCR) using primers specific to the isolated 5' and 3' ends. Exemplary sequences 
are provided in the Sequence Listing. 

The polynucleotides of the invention were ectopically expressed in overexpressor 
or knockout plants and changes in the structure and development characteristics of the plants 
25 were observed. Therefore, the polynucleotides and polypeptides can be employed to improve the 
structure and development characteristics of plants. 

Makine polynucleotides 

The polynucleotides of the invention include sequences that encode transcription 
factors and transcription factor homologue polypeptides and sequences complementary thereto, as 
30 well as unique fragments of coding sequence, or sequence complementary thereto. Such 

polynucleotides can be, e.g., DNA or RNA, e.g., mRNA, cRNA, synthetic RNA, genomic DNA, 
cDNA synthetic DNA, oligonucleotides, etc. The polynucleotides are either double-stranded or 
single-stranded, and include either, or both sense (i.e., coding) sequences and antisense (i.e., non- 
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coding, complementary) sequences. The polynucleotides include the coding sequence of a 
transcription factor, or transcription factor homologue polypeptide, in isolation, in combination 
with additional coding sequences (e.g., a purification tag, a localization signal, as a fusion- 
protein, as a pre-protein, or the like), in combination with non-coding sequences (e.g., introns or 
5 inteins, regulatory elements such as promoters, enhancers, terminators, and the like), and/or in a 
vector or host environment in which the polynucleotide encoding a transcription factor or 
transcription factor homologue polypeptide is an endogenous or exogenous gene. 

A variety of methods exist for producing the polynucleotides of the invention. 
Procedures for identifying and isolating DNA clones are well known to those of skill in the art, 

10 and are described in, e.g., Berger and Kimmel, Guide to Molecular Cloning Techniques. Methods 
in Enzvmology volume 152 Academic Press, Inc., San Diego, CA ("Berger"); Sambrook et al., 
Molecular Cloning - A Laboratory Manual (2nd Ed.), Vol. 1-3, Cold Spring Harbor Laboratory, 
Cold Spring Harbor, New York, 1989 ("Sambrook") and Current Protocols in Molecular Biology. 
F.M. Ausubel et al., eds., Current Protocols, a joint venture between Greene Publishing 

15 Associates, Inc. and John Wiley & Sons, Inc., (supplemented through 2000) ("Ausubel"). 

Alternatively, polynucleotides of the invention, can be produced by a variety of 
in vitro amplification methods adapted to the present invention by appropriate selection of 
specific or degenerate primers. Examples of protocols sufficient to direct persons of skill through 
in vitro amplification methods, including the polymerase chain reaction (PCR) the ligase chain 

20 reaction (LCR), Qbeta-replicase amplification and other RNA polymerase mediated techniques 
(e.g., NASBA), e.g., for the production of the homologous nucleic acids of the invention are 
found in Berger, Sambrook, and Ausubel, as well as Mullis et al., (1987) PCR Protocols A Guide 
to Methods and Applications (Innis et al. eds) Academic Press Inc. San Diego, CA (1990) (Innis). 
Improved methods for cloning in vitro amplified nucleic acids are described in Wallace et al., 

25 U.S. Pat. No. 5,426,039. Improved methods for amplifying large nucleic acids by PCR are 

summarized in Cheng et al. (1994) Nature 369: 684-685 and the references cited therein, in which 
PCR amplicons of up to 40kb are generated. One of skill will appreciate that essentially any 
RNA can be converted into a double stranded DNA suitable for restriction digestion, PCR 
expansion and sequencing using reverse transcriptase and a polymerase. See y e.g., Ausubel, 

30 Sambrook and Berger, all supra. 

Alternatively, polynucleotides and oligonucleotides of the invention can be 
assembled from fragments produced by solid-phase synthesis methods. Typically, fragments of 
up to approximately 100 bases are individually synthesized and then enzymatically or chemically 
ligated to produce a desired sequence, e.g., a polynucleotide encoding all or part of a 
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transcription factor. For example, chemical synthesis using the phosphoramidite method is 
described, e.g., by Beaucage et al. (1981) Tetrahedron Letters 22:1859-69; and Matthes et al. 
(1984) EMBO J. 3:801-5. According to such methods, oligonucleotides are synthesized, purified, 
annealed to their complementary strand, ligated and then optionally cloned into suitable vectors. 
5 And if so desired, the polynucleotides and polypeptides of the invention can be custom ordered 
from any of a number of commercial suppliers. 

HOMOLOGOUS SEQUENCES 

Sequences homologous, i.e., that share significant sequence identity or similarity, 
to those provided in the Sequence Listing, derived from Arabidopsis thaliana or from other plants 

10 of choice are also an aspect of the invention. Homologous sequences can be derived from any 
plant including monocots and dicots and in particular agriculturally important plant species, 
including but not limited to, crops such as soybean, wheat, corn, potato, cotton, rice, oilseed rape 
(including canola), sunflower, alfalfa, sugarcane and turf; or fruits and vegetables, such as 
banana, blackberry, blueberry, strawberry, and raspberry, cantaloupe, carrot, cauliflower, coffee, 

15 cucumber, eggplant, grapes, honeydew, lettuce, mango, melon, onion, papaya, peas, peppers, 
pineapple, spinach, squash, sweet com, tobacco, tomato, watermelon, rosaceous fruits (such as 
apple, peach, pear, cherry and plum) and vegetable brassicas (such as broccoli, cabbage, 
cauliflower, brussel sprouts and kohlrabi). Other crops, fruits and vegetables whose phenotype 
can be changed include barley, rye, millet, sorghum, currant, avocado, citrus fruits such as 

20 oranges, lemons, grapefruit and tangerines, artichoke, cherries, nuts such as the walnut and 
peanut, endive, leek, roots, such as arrowroot, beet, cassava, turnip, radish, yam, and sweet 
potato, and beans. The homologous sequences may also be derived from woody species, such 
pine, poplar and eucalyptus. 

Transcription factors that are homologous to the listed sequences will typically 

25 share at least about 30% amino acid sequence identity. More closely related transcription factors 
can share at least about 50%, about 60%, about 65%, about 70%, about 75% or about 80% or 
about 90% or about 95% or about 98% or more sequence identity with the listed sequences. 
Factors that are most closely related to the listed sequences share, e.g., at least about 85%, about 
90% or about 95% or more % sequence identity to the listed sequences. At the nucleotide level, 

30 the sequences will typically share at least about 40% nucleotide sequence identity, preferably at 
least about 50%, about 60%, about 70% or about 80% sequence identity, and more preferably 
about 85%, about 90%, about 95% or about 97% or more sequence identity to one or more of the 
listed sequences. The degeneracy of the genetic code enables major variations in the nucleotide 
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sequence of a polynucleotide while maintaining the amino acid sequence of the encoded protein. 
Conserved domains within a transcription factor family may exhibit a higher degree of sequence 
homology, such as at least 65% sequence identity including conservative substitutions, and 
preferably at least 80% sequence identity. 

5 Identifying Nucleic Acids bv Hybridization 

Polynucleotides homologous to the sequences illustrated in the Sequence Listing 

can be identified, e.g., by hybridization to each other under stringent or under highly stringent 

conditions. Single stranded polynucleotides hybridize when they associate based on a variety of 

well characterized physico-chemical forces, such as hydrogen bonding, solvent exclusion, base 

10 stacking and the like. The stringency of a hybridization reflects the degree of sequence identity 
of the nucleic acids involved, such that the higher the stringency, the more similar are the two 
polynucleotide strands. Stringency is influenced by a variety of factors, including temperature, 
salt concentration and composition, organic and non-organic additives, solvents, etc. present in 
both the hybridization and wash solutions and incubations (and number), as described in more 

1 5 detail in the references cited above. 

An example of stringent hybridization conditions for hybridization of 
complementary nucleic acids which have more than 100 complementary residues on a filter in a 
Southern or northern blot is about 5°C to 20°C lower than the thermal melting point (Tm) for the 
specific sequence at a defined ionic strength and pH. The T m is the temperature (under defined 

20 ionic strength and pH) at which 50% of the target sequence hybridizes to a perfectly matched 
probe. Nucleic acid molecules that hybridize under stringent conditions will typically hybridize 
to a probe based on either the entire cDNA or selected portions, e.g., to a unique subsequence, of 
the cDNA under wash conditions of 0.2x SSC to 2.0 x SSC, 0.1% SDS at 50-65° C, for example 
0.2 x SSC, 0.1% SDS at 65° C. For identification of less closely related homologues washes can 

25 be performed at a lower temperature, e.g., 50° C. In general, stringency is increased by raising 
the wash temperature and/or decreasing the concentration of SSC. 

As another example, stringent conditions can be selected such that an 
oligonucleotide that is perfectly complementary to the coding oligonucleotide hybridizes to the 
coding oligonucleotide with at least about a 5-10x higher signal to noise ratio than the ratio for 

30 hybridization of the perfectly complementary oligonucleotide to a nucleic acid encoding a 
transcription factor known as of the filing date of the application. Conditions can be selected 
such that a higher signal to noise ratio is observed in the particular assay which is used, e.g., 
about 15x, 25x, 35x, 50x or more. Accordingly, the subject nucleic acid hybridizes to the unique 
coding oligonucleotide with at least a 2x higher signal to noise ratio as compared to hybridization 
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of the coding oligonucleotide to a nucleic acid encoding known polypeptide. Again, higher 
signal to noise ratios can be selected, e.g., about 5x, lOx, 25x, 35x, 50x or more. The particular 
signal will depend on the label used in the relevant assay, e.g., a fluorescent label, a colorimetric 
label, a radio active label, or the like. 
5 Alternatively, transcription factor homologue polypeptides can be obtained by 

screening an expression library using antibodies specific for one or more transcription factors. 
With the provision herein of the disclosed transcription factor, and transcription factor homologue 
nucleic acid sequences, the encoded polypeptide(s) can be expressed and purified in a 
heterologous expression system (e.g., E. coli) and used to raise antibodies (monoclonal or 

1 0 polyclonal) specific for the polypeptide(s) in question. Antibodies can also be raised against 
synthetic peptides derived from transcription factor, or transcription factor homologue, amino 
acid sequences. Methods of raising antibodies are well known in the art and are described in 
Harlow and Lane (1988) Antibodies: A Laboratory Manual Cold Spring Harbor Laboratory, New 
York. Such antibodies can then be used to screen an expression library produced from the plant 

1 5 from which it is desired to clone additional transcription factor homologues, using the methods 
described above. The selected cDNAs can be confirmed by sequencing and enzymatic activity. 

SF QTFRNCE VARIATIONS 

It will readily be appreciated by those of skill in the art, that any of a variety of 

polynucleotide sequences are capable of encoding the transcription factors and transcription 
20 factor homologue polypeptides of the invention. Due to the degeneracy of the genetic code, 

many different polynucleotides can encode identical and/or substantially similar polypeptides in 

addition to those sequences illustrated in the Sequence Listing. 

For example, Table 1 illustrates, e.g., that the codons AGC, AGT, TCA, TCC, 

TCG, and TCT all encode the same amino acid: serine. Accordingly, at each position in the 
25 sequence where there is a codon encoding serine, any of the above trinucleotide sequences can be 

used without altering the encoded polypeptide. 
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Table 1 



Amino acids 


Codon 


Alanine 


Ala 


A 


GCA 


GCC 


GCG 


GCU 






Cysteine 


Cys 


C 


TGC 


TGT 










Aspartic acid 


Asp 


D 


GAC 


GAT 










Glutamic acid 


Glu 


E 


GAA 


GAG 










Phenylalanine 


Phe 


F 


TTC 


TTT 










Glycine 


Gly 


G 


GGA 


GGC 


GGG 


GGT 






Histidine 


His 


H 


CAC 


CAT 










Isoleucine 


lie 


I 


ATA 


ATC 


ATT 








Lysine 


Lys 


K 


AAA 


A A 

AAG 








pTT 


Leucine 


Leu 


L 


TTA 


TTG 








Methionine 


Met 


M 


ATG 












Asparagine 


Asn 


N 


AAC 


AAT 










Proline 


Pro 


P 


CCA 


CCC 


CCG 


CCT 






Glutamine 


Gin 


Q 


CAA 


CAG 










Arginine 


Arg 


R 


AGA 


AGG 


CGA 


CGC 


CGG 


CGT 


Serine 


Ser 


S 


AGC 


AGT 


TCA 


TCC 


TCG 


TCT 


Threonine 


Thr 


T 


ACA 


ACC 


ACG 


ACT 






Valine 


Val 


V 


GTA 


GTC 


GTG 


GTT 






Tryptophan 


Trp 


W 


TGG 












Tyrosine 


Tyr 


Y 


TAC 


TAT 











Sequence alterations that do not change the amino acid sequence encoded by the 
5 polynucleotide are termed "silent" variations. With the exception of the codons ATG and TGG, 
encoding methionine and tryptophan, respectively, any of the possible codons for the same amino 
acid can be substituted by a variety of techniques, e.g., site-directed mutagenesis, available in the 
art. Accordingly, any and all such variations of a sequence selected from the above table are a 
feature of the invention. 

10 In addition to silent variations, other conservative variations that alter one, or a 

few amino acids in the encoded polypeptide, can be made without altering the function of the 
polypeptide, these conservative variants are, likewise, a feature of the invention. 

For example, substitutions, deletions and insertions introduced into the sequences 
provided in the Sequence Listing are also envisioned by the invention. Such sequence 

1 5 modifications can be engineered into a sequence by site-directed mutagenesis (Wu (ed.) Meth, 
Enzvmol . (1993) vol. 217, Academic Press) or the other methods noted below. Amino acid 
substitutions are typically of single residues; insertions usually will be on the order of about from 
1 to 10 amino acid residues; and deletions will range about from I to 30 residues. In preferred 
embodiments, deletions or insertions are made in adjacent pairs, e.g., a deletion of two residues or 

20 insertion of two residues. Substitutions, deletions, insertions or any combination thereof can be 
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combined to airive at a sequence. The mutations that are made in the polynucleotide encoding the 
transcription factor should not place the sequence out of reading frame and should not create 
complementary regions that could produce secondary mRNA structure. Preferably, the 
polypeptide encoded by the DNA performs the desired function. 

Conservative substitutions are those in which at least one residue in the amino 
acid sequence has been removed and a different residue inserted in its place. Such substitutions 
generally are made in accordance with the Table 2 when it is desired to maintain the activity of 
the protein. Table 2 shows amino acids which can be substituted for an amino acid in a protein 
and which are typically regarded as conservative substitutions. 

Table 2 



Residue 


Conservative oudsuiuuuh» 


Ala 


OCT 


Arg 


Lys 


Asn 


Gin- His 

vjiii, ilia 


Asp 


Gin 

VJLU 


Gin 


Asn 


Cys 


Ser 


Glu 


Asp 


Gly 


Pro 


His 


Asn; Gin 


lie 


Leu, Val 


Leu 


He; Val 


Lys 


Arg; Gin 


Met 


Leu; He 


Phe 


Met; Leu; Tyr 


Ser 


Thr; Gly 


Thr 


Ser;Val 


Trp 


Tyr 


Tyr 


Trp; Phe 


Val 


He; Leu 



Substitutions that are less conservative than those in Table 2 can be selected by 
picking residues that differ more significantly in their effect on maintaining (a) the structure of 
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the polypeptide backbone in the area of the substitution, for example, as a sheet or helical 
conformation, (b) the charge or hydrophobicity of the molecule at the target site, or (c) the bulk of 
the side chain. The substitutions which in general are expected to produce the greatest changes in 
protein properties will be those in which (a) a hydrophilic residue, e.g., seryl or threonyl, is 
5 substituted for (or by) a hydrophobic residue, e.g., leucyl, isoleucyl, phenylalanyl, valyl or alanyl; 
(b) a cysteine or proline is substituted for (or by) any other residue; (c) a residue having an 
electropositive side chain, e.g., lysyl, arginyl, or histidyl, is substituted for (or by) an 
electronegative residue, e.g., glutamyl or aspartyl; or (d) a residue having a bulky side chain, e.g., 
phenylalanine, is substituted for (or by) one not having a side chain, e.g., glycine. 

10 FURTHER MODIFYING SEQUENCES OF THE INVENTION — MUTA TION/ FORCED 
EVOLUTION 

In addition to generating silent or conservative substitutions as noted, above, the 
present invention optionally includes methods of modifying the sequences of the Sequence 
Listing. In the methods, nucleic acid or protein modification methods are used to alter the given 

15 sequences to produce new sequences and/or to chemically or enzymatically modify given 
sequences to change the properties of the nucleic acids or proteins. 

Thus, in one embodiment, given nucleic acid sequences are modified, e.g., 
according to standard mutagenesis or artificial evolution methods to produce modified sequences. 
For example, Ausubel, supra, provides additional details on mutagenesis methods. Artificial 

20 forced evolution methods are described, e.g., by Stemmer (1994) Nature 370:389-391, and 
Stemmer (1994) Proc. Natl. Acad. Sci. USA 91 : 10747-10751 . Many other mutation and 
evolution methods are also available and expected to be within the skill of the practitioner. 

Similarly, chemical or enzymatic alteration of expressed nucleic acids and 
polypeptides can be performed by standard methods. For example, sequence can be modified by 

25 addition of lipids, sugars, peptides, organic or inorganic compounds, by the inclusion of modified 
nucleotides or amino acids, or the like. For example, protein modification techniques are 
illustrated in Ausubel, supra. Further details on chemical and enzymatic modifications can be 
found herein. These modification methods can be used to modify any given sequence, or to 
modify any sequence produced by the various mutation and artificial evolution modification 

3 0 methods noted herein. 

Accordingly, the invention provides for modification of any given nucleic acid 
by mutation, evolution, chemical or enzymatic modification, or other available methods, as well 
as for the products produced by practicing such methods, e.g., using the sequences herein as a 
starting substrate for the various modification approaches. 
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For example, optimized coding sequence containing codons preferred by a 
particular prokaryotic or eukaryotic host can be used e.g., to increase the rate of translation or to 
produce recombinant RNA transcripts having desirable properties, such as a longer half-hfe, as 
compared with transcripts produced using a non-optimized sequence. Translation stop codons 
can also be modified to reflect host preference. For example, preferred stop codons for S. 
cerevisiae and mammals are TAA and TGA, respectively. The preferred stop codon for 
monocotyledonous plants is TGA, whereas insects and E. coli prefer to use TAA as the stop 

codon. . 

The polynucleotide sequences of the present invention can also be engineered in 

order to alter a coding sequence for a variety of reasons, including but not limited to, alterations 
which modify the sequence to facilitate cloning, processing and/or expression of the gene, 
product. For example, alterations are optionally introduced using techniques which are well 
known in the art, e.g., site-directed mutagenesis, to insert new restriction sites, to alter 
glycosylate patterns, to change codon preference, to introduce splice sites, etc. 

Furthermore, a fragment or domain derived from any of the polypeptides of the 
invention can be combined with domains derived from other transcription factors or synthetic 
domains to modify the biological activity of a transcription factor. For instance, a DNA binding 
domain derived from a transcription factor of the invention can be combined with the activation 
domain of another transcription factor or with a synthetic activation domain. A transcription 
activation domain assists in initiating transcription from a DNA binding site. Examples include 
the transcription activation region of VP16 or GAL4 (Moore et al. (1998) Proc Natl. Acad. Sci. 
USA 95- 376-381; and Aoyama et al. (1995) PlantCell 7:1773-1785), peptides derived from 
bacterial sequences (Ma and Ptashne (1987) Cell 51; 1 13-1 19) and synthetic peptides (Giniger 
and Ptashne, (1987) Nature 330:670-672). 

EXPRESSION AND MODIFIC ATION OF POLYPEPTIDES, 

Typically, polynucleotide sequences of the invention are incorporated into 

recombinant DNA (or RNA) molecules that direct expression of polypeptides of the invention in 
appropriate host cells, transgenic plants, in vitro translation systems, or the like. Due to the 
inherent degeneracy of the genetic code, nucleic acid sequences which encode substantially the 
same or a functionally equivalent amino acid sequence can be substituted for any listed sequence 
to provide for cloning and expressing the relevant homologue. 
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Vectors. Promoters and Expression Systems 

The present invention includes recombinant constructs comprising one or more 
of the nucleic acid sequences herein. The constructs typically comprise a vector, such as a 
plasmid, a cosmid, a phage, a virus (e.g., a plant virus), a bacterial artificial chromosome (BAC), 
5 a yeast artificial chromosome (YAC), or the like, into which a nucleic acid sequence of the 
invention has been inserted, in a forward or reverse orientation. In a preferred aspect of this 
embodiment, the construct further comprises regulatory sequences, including, for example, a 
promoter, operably linked to the sequence. Large numbers of suitable vectors and promoters are 
known to those of skill in the art, and are commercially available. 

1 0 General texts which describe molecular biological techniques useful herein, 

including the use and production of vectors, promoters and many other relevant topics, include 
Berger, Sambrook and Ausubel, supra. Any of the identified sequences can be incorporated into a 
cassette or vector, e.g., for expression in plants. A number of expression vectors suitable for stable 
transformation of plant cells or for the establishment of transgenic plants have been described 

1 5 including those described in Weissbach and Weissbach, (1 989; Methods for Plant Molecular 
Biology , Academic Press, and Gelvin et al., (1990) Plant Molecu lar Biology Manual Kluwer 
Academic Publishers. Specific examples include those derived from a Ti plasmid of 
Agrobacterium tumefaciens, as well as those disclosed by Herrera-Estrella et al. (1983) Nature 
303: 209, Bevan (1984) Nucl Acid Res. 12: 871 1-8721, Klee (1985) Bio/Technology 3: 637-642, 

20 for dicotyledonous plants. 

Alternatively, non-Ti vectors can be used to transfer the DNA into 
monocotyledonous plants and cells by using free DNA delivery techniques. Such methods can 
involve, for example, the use of liposomes, electroporation, microprojectile bombardment, silicon 
carbide whiskers, and viruses. By using these methods transgenic plants such as wheat, rice 

25 (Christou (1991) Bio/Technology 9: 957-962) and corn (Gordon-Kamm (1990) Plant Cell 2: 603- 
61 8) can be produced. An immature embryo can also be a good target tissue for monocots for 
direct DNA delivery techniques by using the particle gun (Weeks et al. (1993) Plant Physiol 102: 
1077-1084; Vasil (1993) Bio/Technology 10: 667-674; Wan and Lemeaux (1994) Plant Physiol 
104: 37-48, and for Agrobacterium-mediated DNA transfer (Ishida et al. (1996) Nature Biotech 

30 14: 745-750). 

Typically, plant transformation vectors include one or more cloned plant coding 
sequence (genomic or cDNA) under the transcriptional control of 5' and 3 1 regulatory sequences 
and a dominant selectable marker. Such plant transformation vectors typically also contain a 
promoter (e.g., a regulatory region controlling inducible or constitutive, environmentally-or 



18 



PCT7US00/31325 

WO 01/36444 

developmentally-regulated, or cell- or tissue-specific expression), a transcription initiation start 
site, an RNA processing signal (such as intron splice sites), a transcription termination site, and/or 

a polyadenylation signal. 

Examples of constitutive plant promoters which can be useful for expressing the 
5 TF sequence include: the cauliflower mosaic virus (CaMV) 35S promoter, which confers 
constitutive, high-level expression in most plant tissues e.g., Odel et al. (1985) Nature 
313:810); the nopaline synthase promoter (An et al. (1988) PlantJM 88:547); and the 
octopine synthase promoter (Fromm et al. (1989) PJantCeJ! 1: 977). 

A variety of plant gene promoters that regulate gene expression in response to 
10 environmental, hormonal, chemical, developmental signals, and in a tissue-active manner can be 
used for expression of a TF sequence in plants. Choice of a promoter is based largely on the 
phenotype of interest and is determined by such factors as tissue (e.g., seed, fruit, root, pollen, 
vascular tissue, flower, carpel, etc.), inducibility (e.g., in response to wounding, heat, cold, 
drought, light, pathogens, etc.), timing, developmental stage, and the like. Numerous known 
1 5 promoters have been characterized and can favorable be employed to promote expression of a 
polynucleotide of the invention in a transgenic plant or cell of interest. For example, tissue 
specific promoters include: seed-specific promoters (such as the napin, phaseolin or DC3 
promoter described in US Pat. No. 5,773,697), fruit-specific promoters that are active during fruit 
ripening (such as the dru 1 promoter (US Pat. No. 5,783,393), or the 2A1 1 promoter (US Pat. No. 
20 4 943,674) and the tomato polygalacturonase promoter (Bird et al. (1988) PJantMoi£iol 1 1:651), 
root-specific promoters, such as those d.sclosed in US Patent Nos. 5,61 8,988, 5,837,848 and 
5 905 186, pollen-active promoters such as PTA29, PTA26 and PTA13 (US Pat. No. 5,792,929), 
promoters active in vascular tissue (Ringli and Keller (1998) Plant Mol Bio l 37:977-988), flower- 
specific (Kaiser et al, (1995) Plant Mol Biol 28:231-243), pollen (Baerson et al. (1994) PlantMol 
25 Biol 26: 1947-1959). carpels (Ohl et al. (1990) Plant Cell 2:837-848), pollen and ovules (Baerson 
et al. (1993) Plant Mol Biol 22:255-267), auxin-inducible promoters (such as that described in 
van der Kop et al. (1999) Plant Mol Biol 39:979-990 or Baumann et al. (1999) PJantCeJl 1 1:323- 
334) cytokinin-inducible promoter (Guevara-Garcia (1998) PlaritMoiBiol 38:743-753), 
promoters responsive to gibbere.lin (Shi et al. (1998) Plant Mol Biol 38:1053-1060, Wfflmott et 
30 al. (1998) 38:817-825) and the like. Additional promoters are those that elicit expression in 
response to heat (Ainley et al. (1993) Plant Mol Biol 22: 13-23), light (e.g., the pea rbcS-3A 
promoter, Kuhlemeier et al. (1989) Plant Cell 1:471, and the maize rbcS promoter, Schaffher and 
Sheen (1991) Plant Cell 3: 997); wounding (e.g., wunl, Siebertz et al. (1989) PiantCell 1: 961); 
pathogens (such as the PR-1 Fomoter described in Buchel et al. (1999) Plant Mol. Biol. 40:387- 
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396, and the PDF1.2 promoter described in Manners et at. (1998) Plant Mol. Biol 38:1071-80), 
and chemicals such as methyl jasmonate or salicylic acid (Gate et al. (1997) Plant Mol Biol 48: 89- 
108). In addition, the timing of the expression can be controlled by using promoters such as those 
acting at senescence (An and Amazon (1995) Science 270: 1986-1988); or late seed development 
5 (Odell et al. (1994) Plant Physiol 106:447-458). 

Plant expression vectors can also include RNA processing signals that can be 
positioned within, upstream or downstream of the coding sequence. In addition, the expression 
vectors can include additional regulatory sequences from the 3 '-untranslated region of plant 
genes, e.g., a 3' terminator region to increase mRNA stability of the mRNA, such as the PI-II 
10 terminator region of potato or the octopine or nopaline synthase 3* terminator regions. 

Additional Expression Elements 

Specific initiation signals can aid in efficient translation of coding sequences. 
These signals can include, e.g., the ATG initiation codon and adjacent sequences. In cases where 
a coding sequence, its initiation codon and upstream sequences are inserted into the appropriate 

1 5 expression vector, no additional translational control signals may be needed. However, in cases 
where only coding sequence (e.g., a mature protein coding sequence), or a portion thereof, is 
inserted, exogenous transcriptional control signals including the ATG initiation codon can be 
separately provided. The initiation codon is provided in the correct reading frame to facilitate 
transcription. Exogenous transcriptional elements and initiation codons can be of various origins, 

20 both natural and synthetic. The efficiency of expression can be enhanced by the inclusion of 
enhancers appropriate to the cell system in use. 

Expression Hosts 

The present invention also relates to host cells which are transduced with vectors 
of the invention, and the production of polypeptides of the invention (including fragments 

25 thereof) by recombinant techniques. Host cells are genetically engineered (i.e, nucleic acids are 
introduced, e.g., transduced, transformed or transfected) with the vectors of this invention, which 
may be, for example, a cloning vector or an expression vector comprising the relevant nucleic 
acids herein. The vector is optionally a plasmid, a viral particle, a phage, a naked nucleic acids, 
etc. The engineered host cells can be cultured in conventional nutrient media modified as 

30 appropriate for activating promoters, selecting transformants, or amplifying the relevant gene. 
The culture conditions, such as temperature, pH and the like, are those previously used with the 
host cell selected for expression, and will be apparent to those skilled in the art and in the 
references cited herein, including, Sambrook and Ausubel. 
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The host cell can be a eukaryotic cell, such as a yeast cell, or a plant cell, or the 
host cell can be a prokaryotic cell, such as a bacterial cell. Plant protoplasts are also suitable for 
some applications. For example, the DNA fragments are introduced into plant tissues, cultured 
plant cells or plant protoplasts by standard methods including electroporation (Fromm et al., 
5 (1985) Procjjafl Acad. Sci. USA 82, 5824, infection by viral vectors such as cauliflower mosa.c 
virus (CaMV) (Hohn et al., (1982) Molecular Biology of Plant Tumors , (Academic Press, New 
York) pp. 549-560; US 4,407,956), high velocity ballistic penetration by small particles with the 
nucleic acid either within the matrix of small beads or particles, or on the surface (Klein et al., 
(1987) Nature 327, 70-73), use of pollen as vector (WO 85/01856), or use of Agrobacterium 
10 tumefaciens or A. rhizogenes carrying a T-DNA plasmid in which DNA fragments are cloned. 
The T-DNA plasmid is transmitted to plant cells upon infection by Agrobacterium tumefaciens, 
and a portion is stably integrated into the plant genome (Horsch et al. (1984) Sdence_233:496- 
498; Fraley et al. (1983) P«™ Natl. Aca d, Sci. USA 80, 4803). 

The cell can include a nucleic acid of the invention which encodes a polypeptide, 
15 wherein the cells expresses a polypeptide of the invention. The cell can also include vector 

sequences, or the like. Furthermore, cells and transgenic plants which include any polypeptide or 
nucleic acid above or throughout this specification, e.g., produced by transduction of a vector of 
the invention, are an additional feature of the invention. 

For long-term, high-yield production of recombinant proteins, stable expression 
20 can be used. Host cells transformed with a nucleotide sequence encoding a polypeptide of the 
invention are optionally cultured under conditions suitable for the expression and recovery of the 
encoded protein from cell culture. The protein or fragment thereof produced by a recombinant 
cell may be secreted, membrane-bound, or contained intracellularly, depending on the sequence 
and/or the vector used. As will be understood by those of skill in the art, expression vectors 
25 containing polynucleotides encoding mature proteins of the invention can be designed with signal 
sequences which direct secretion of the mature polypeptides through a prokaryotic or eukaryotic 
cell membrane. 

Modified Amino Acids . n ., 

Polypeptides of the invention may contain one or more modified amino acids. 

30 The presence of modified amino acids may be advantageous in, for example, increasing 

polypeptide half-life, reducing polypeptide antigenicity or toxicity, increasing polypeptide storage 

stability, or the like. Amino acid(s) are modified, for example, co-translationaUy or post- 

translationally during recombinant production or modified by synthetic or chemical means. 
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Non-limiting examples of a modified amino acid include incorporation or other 
use of acetylated amino acids, glycosylated amino acids, sulfated amino acids, prenylated (e.g., 
farnesylated, geranylgeranylated) amino acids, PEG modified (e.g., "PEGylated") amino acids, 
biotinylated amino acids, carboxylated amino acids, phosphorylated amino acids, etc. References 
5 adequate to guide one of skill in the modification of amino acids are replete throughout the 
literature. 

IDENTIFICATION OF ADDITIONAL FACTORS 

A transcription factor provided by the present invention can also be used to 
identify additional endogenous or exogenous molecules that can affect a phentoype or trait of 

10 interest. On the one hand, such molecules include organic (small or large molecules) and/or 
inorganic compounds that affect expression of (i.e., regulate) a particular transcription factor. 
Alternatively, such molecules include endogenous molecules that are acted upon either at a 
transcriptional level by a transcription factor of the invention to modify a phenotype as desired. 
For example, the transcription factors can be employed to identify one or more downstream gene 

1 5 with which is subject to a regulatory effect of the transcription factor. In one approach, a 

transcription factor or transcription factor homologue of the invention is expressed in a host cell, 
e.g, a transgenic plant cell, tissue or explant, and expression products, either RNA or protein, of 
likely or random targets are monitored, e.g., by hybridization to a microarray of nucleic acid 
probes corresponding to genes expressed in a tissue or cell type of interest, by two-dimensional 

20 gel electrophoresis of protein products, or by any other method known in the art for assessing 
expression of gene products at the level of RNA or protein. Alternatively, a transcription factor 
of the invention can be used to identify promoter sequences (i.e., binding sites) involved in the 
regulation of a downstream target. After identifying a promoter sequence, interactions between 
the transcription factor and the promoter sequence can be modified by changing specific 

25 nucleotides in the promoter sequence or specific amino acids in the transcription factor that 
interact with the promoter sequence to alter a plant trait. Typically, transcription factor DNA 
binding sites are identified by gel shift assays. After identifying the promoter regions, the 
promoter region sequences can be employed in double-stranded DNA arrays to identify 
molecules that affect the interactions of the transcription factors with their promoters (Bulyk et al. 

30 (1999) Nature Biotechnology 17:573-577). 

The identified transcription factors are also useful to identify proteins that modify 
the activity of the transcription factor. Such modification can occur by covalent modification, 
such as by phosphorylation, or by protein-protein (homo or-heteropolymer) interactions. Any 
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20 rnFNTTPlC ATI ™ OF MO™ JLATORS 

In addition to the intracellular molecules described above, extracellular 

m o,ecules that alter activity or expression 

can be identified. For example, the methods can entail first placing a candidate molecule « 
contact withaplantorplant cell. The molecule can be introduced by topical admimstratton such 

25 as spraying or soaking of a plant, and then the molecule's effect on the expression c, r actmty of 
th^TF polypeptide or the expression of the polynucleotide monitored. Changes in the expression 
of the TF polypeptide can be monitored by use of polyclonal or monoclonal antxbod.es, gel 
electrophoresis or the like. Changes in the expression of the corresponding polynucleotide 
sequence can be detected by use of microarrays, Northerns, quantitative PCR, or any other 

30 techniqueformonitormgchangesinmRNAexpression. These techniques are exempted « 
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changes in the expression levels canbe correlated with modified plant traits and thus tdenUfied 
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molecules can be useful for soaking or spraying on fruit, vegetable and grain crops to modify 
traits in plants. 

Essentially any available composition can be tested for modulatory activity of 
expression or activity of any nucleic acid or polypeptide herein. Thus, available libraries of 
5 compounds such as chemicals, polypeptides, nucleic acids and the like can be tested for 
modulatory activity. Often, potential modulator compounds can be dissolved in aqueous or 
organic (e.g., DMSO-based) solutions for easy delivery to the cell or plant of interest in which the 
activity of the modulator is to be tested. Optionally, the assays are designed to screen large 
modulator composition libraries by automating the assay steps and providing compounds from 

1 0 any convenient source to assays, which are typically run in parallel (e.g., in microtiter formats on 
microtiter plates in robotic assays). 

In one embodiment, high throughput screening methods involve providing a 
combinatorial library containing a large number of potential compounds (potential modulator 
compounds). Such "combinatorial chemical libraries" are then screened in one or more assays, as 

1 5 described herein, to identify those library members (particular chemical species or subclasses) 
that display a desired characteristic activity. The compounds thus identified can serve as target 
compounds. 

A combinatorial chemical library can be, e.g., a collection of diverse chemical 
compounds generated by chemical synthesis or biological synthesis. For example, a 

20 combinatorial chemical library such as a polypeptide library is formed by combining a set of 
chemical building blocks (e.g., in one example,'amino acids) in every possible way for a given 
compound length (i.e., the number of amino acids in a polypeptide compound of a set length). 
Exemplary libraries include peptide libraries, nucleic acid libraries, antibody libraries (see, e.g., 
Vaughn et al. (1996) Nature Biotechnology. 14(3):309-314 and PCT/US96/10287), carbohydrate 

25 libraries (see, e.g., Liang et al. Science (1996) 274:1520-1522 and U.S. Patent 5,593,853), 
peptide nucleic acid libraries (see, e.g., U.S. Patent 5,539,083), and small organic molecule 
libraries (see, e.g., benzodiazepines, Baum C&EN Jan 18, page 33 (1993); isoprenoids, U.S. 
Patent 5,569,588; thiazolidinones and metathiazanones, U.S. Patent 5,549,974; pyrrolidines, U.S. 
Patents 5,525,735 and 5,519,134; morpholino compounds, U.S. Patent 5,506,337) and the like. 

30 Preparation and screening of combinatorial or other libraries is well known to 

those of skill in the art. Such combinatorial chemical libraries include, but are not limited to, 
peptide libraries (see, e.g., U.S. Patent 5,010,175, Furka, Int. J. Pept. Prot. Res. 37:487-493 
(1991) and Houghton et al. Nature 354:84-88 (1991)). Other chemistries for generating chemical 
diversity libraries can also be used. 
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I„ addition, as noted, compound screening equipment for high-throughput 

have alsobeendeveloped for so.utionphasechemist.es useful ^^^^ 
include automated workstations including an automated synthesis apparatus and robonc sys^ms 

eg forhigh-throughputscreeningofpotent.almodulators. The nature and .mplemen^on of 
edifications to these devices (if any) so that they can operate as discussed herein wtll be 
apparent to persons skilled in the relevant art. 

Indeed, entire high throughput screening systems are commercmlly avaUable. 
, These systems typically automate entire procedures including all sample and reagent pipettmg, 
liquid dispensing, timed incubations, and final readings of the microplate in detectors) 
appropriate for the assay. These configurable systen. provide mgh throughput and rap.d s^u^ 
as well as a high degree of flexibility and customization. Similarly, microfluidic .mplementataons 
of screening are also commercially available. 
5 The manufacturers of such systems provide detailed protocols the various lugh 

throughput. Thus, for example, Zymark Corp. provides technical bulletins describing screemng 
systems for detecting the modulation of gene transcription, ligand binding, and the hke. The 
integrated systems herein, in addition to providing for sequence alignment and, opfonally 
synthesis of relevant nucleic acids, can include such screening apparatus to identify modulators 
0 that have an effect on one or more polynucleotides or polypeptides according to the present 
invention. 

In some assays it is desirable to have positive controls to ensure that the 
components of the assays are working properly. At least two types of positive controls are 
appropriate. That is, know, transcriptional activators or inhibitors can be incubated wUh 
>5 cells/plants/ etc. in one sample of the assay, and the resulting increase/decrease in transcnp.cn 
can be detected by measuring the resulnng increase in RNA/ protein expression, etc., accordmg to 
the methods herein. It will be appreciated that modulators can also be combined wuh 
transcriptional activators or inhibitors to find modulators which inhibit transcriptional action 
or transcriptional repression. Either expression of the nucleic acids and proteins herein or any 
additional nucleic acids or proteins activated by the nucleic acids or proteins herein, or both, can 
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In an embodiment, me invention provides a method for idenUfymg composes 

that modulate the activity or expression of a polynucleotide or polypeptide of the invention For 

example, a test compound, whether a small or large molecule, is placed in contact wth a cell, 
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plant (or plant tissue or explant), or composition comprising the polynucleotide or polypeptide of 
interest and a resulting effect on the cell, plant, (or tissue or explant) or composition is evaluated 
by monitoring, either directly or indirectly, one or more of: expression level of the polynucleotide 
or polypeptide, activity (or modulation of the activity) of the polynucleotide or polypeptide. In 
some cases, an alteration in a plant phenotype can be detected following contact of a plant (or 
plant cell, or tissue or explant) with the putative modulator, e.g., by modulation of expression or 
activity of a polynucleotide or polypeptide of the invention. 

SUBSEQUENCES 

Also contemplated are uses of polynucleotides, also referred to herein as 

oligonucleotides, typically having at least 12 bases, preferably at least 15, more preferably at least 

20, 30, or 50 bases, which hybridize under at least highly stringent (or ultra-high stringent or 

ultra-ultra- high stringent conditions) conditions to a polynucleotide sequence described above. 

The polynucleotides may be used as probes, primers, sense and antisense agents, and the like, 

according to methods as noted supra. 

Subsequences of the polynucleotides of the invention, including polynucleotide 

fragments and oligonucleotides are useful as nucleic acid probes and primers. An oligonucleotide 

suitable for use as a probe or primer is at least about 15 nucleotides in length, more often at least 

about 18 nucleotides, often at least about 21 nucleotides, frequently at least about 30 nucleotides, 

or about 40 nucleotides, or more in length. A nucleic acid probe is useful in hybridization 

protocols, e.g., to identify additional polypeptide homologues of the invention, including 

protocols for microarray experiments. Primers can be annealed to a complementary target DNA 

strand by nucleic acid hybridization to form a hybrid between the primer and the target DNA 

strand, and then extended along the target DNA strand by a DNA polymerase enzyme. Primer 

pairs can be used for amplification of a nucleic acid sequence, e.g., by the polymerase chain 

reaction (PCR) or other nucleic-acid amplification methods. See Sambrook and Ausubel, supra. 

In addition, the invention includes an isolated or recombinant polypeptide 

including a subsequence of at least about 15 contiguous amino acids encoded by the recombinant 

or isolated polynucleotides of the invention. For example, such polypeptides, or domains or 

fragments thereof, can be used as immunogens, e.g., to produce antibodies specific for the 

polypeptide sequence, or as probes for detecting a sequence of interest. A subsequence can range 

in size from about 15 amino acids in length up to and including the full length of the polypeptide. 
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the introduced antisense sequence in the vector will be at least 30 nucleotides in length, and 
improved antisense suppression will typically be observed as the length of the antisense sequence 
increases. Preferably, the length of the antisense sequence in the vector will be greater than 100 
nucleotides. Transcription of an antisense construct as described results in the production of 
5 RNA molecules that are the reverse complement of mRNA molecules transcribed from the 
endogenous transcription factor gene in the plant celL 

Suppression of endogenous transcription factor gene expression can also be 
achieved using a ribozyme. Ribozymes are RNA molecules that possess highly specific 
endoribonuclease activity. The production and use of ribozymes are disclosed in U.S. Patent No. 

1 0 4,987,071 and U.S. Patent No. 5,543,508. Synthetic ribozyme sequences including antisense 

RNAs can be used to confer RNA cleaving activity on the antisense RNA, such that endogenous 
mRNA molecules that hybridize to the antisense RNA are cleaved, which in turn leads to an 
enhanced antisense inhibition of endogenous gene expression. 

Vectors in which RNA encoded by a transcription factor or transcription factor 

1 5 homologue cDNA is over-expressed can also be used to obtain co-suppression of a corresponding 
endogenous gene, e.g., in the manner described in U.S. Patent No. 5,231,020 to Jorgensen. Such 
co-suppression (also termed sense suppression) does not require that the entire transcription factor 
cDNA be introduced into the plant cells, nor does it require that the introduced sequence be 
exactly identical to the endogenous transcription factor gene of interest. However, as with 

20 antisense suppression, the suppressive efficiency will be enhanced as specificity of hybridization 
is increased, e.g., as the introduced sequence is lengthened, and/or as the sequence similarity 
between the introduced sequence and the endogenous transcription factor gene is increased. 

Vectors expressing an untranslatable form of the transcription factor mRNA, e.g., 
sequences comprising one or more stop codon, or nonsense mutation) can also be used to 

25 suppress expression of an endogenous transcription factor, thereby reducing or eliminating it's 
activity and modifying one or more traits. Methods for producing such constructs are described 
in U.S. Patent No. 5,583,021. Preferably, such constructs are made by introducing a premature 
stop codon into the transcription factor gene. Alternatively, a plant trait can be modified by gene 
silencing using double-strand RNA (Sharp (1999) Genes and Development 13: 139-141). 

30 Another method for abolishing the expression of a gene is by insertion 

mutagenesis using the T-DNA of Agrobacterium tumefaciens. After generating the insertion 
mutants, the mutants can be screened to identify those containing the insertion in a transcription 
factor or transcription factor homologue gene. Plants containing a single transgene insertion 
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event at the desired gene can be crossed to generate homozygous plants for the mutation (Koncz 
etal (1992) Methods IS &rabjdgjsis Research, World Scientific). 

Alternatively, a plant phenotype can be altered by eliminating an endogenous 
gene, such as a transcription factor or transcription factor homologue, e.g., by homologous 
5 recombination (Kempin et al. (1997) Nature 389:802). 

A plant trait can also be modified by using the cre-lox system (for example, as 
described in US Pat. No. 5,658,772). A plant genome can be modified to include first and 
second lax sites that are then contacted with a Cre recombinase. If the lox sites are in the same 
• orientation, the intervening DNA sequence between the two sites is excised. If the lox sites are m 
1 0 the opposite orientation, the intervening sequence is inverted. 

The polynucleotides and polypeptides of this invention can also be expressed m a 
plant in the absence of an expression cassette by manipulating the activity or expression level of 
the endogenous gene by other means. For example, by ectopically expressing a gene by T-DNA 
activation tagging (Ichikawa et al. (1997) Nature 390 698-701; Kakimoto et al. (1996) Science 
15 274- 982-985) This method entails transforming a plant with a gene tag containing multiple 

transcriptional enhancers and once the tag has inserted into the genome, expression of a flanking 
gene coding sequence becomes deregulated. In another example, the transcriptional machinery ,n 
a plant can be modified so as to increase transcription levels of a polynucleotide of the invention 
{See e g PCT Publications WO 96/06166 and WO 98/53057 which describe the modification of 
20 the DNA binding specificity of zinc finger proteins by changing particular amino acids in the 

DNA binding motif). 

The transgenic plant can also include the machinery necessary for expressing or 
altering the activity of a polypeptide encoded by an endogenous gene, for example by altering the 
phosphorylation state of the polypeptide to maintain it in an activated state. 

25 Transgenic plants (or plant cells, or plant explants, or plant tissues) incorporating 

the polynucleotides of the invention and/or expressing the polypeptides of the invention can be 
produced by a variety of well established techniques as described above. Following constructs 
of a vector, most typically an expression cassette, including a polynucleotide, e.g., encoding a 
transcription factor or transcription factor homologue, of the invention, standard techniques can 

30 be used to introduce the polynucleotide into a plant, a plant cell, a plant explant or a plant tissue 
of interest. Optionally, the plant cell, explant or tissue can be regenerated to produce a transgenic 
plant. 

The plant can be any higher plant, including gymnosperms, monocotyledonous 
and dicotyledenous plants. Suitable protocols are available for Leguminosae (alfalfa, soybean, 



29 



WO 01/36444 



PCT/US00/31325 



clover, etc.), Umbelliferae (carrot, celery, parsnip), Cruciferae (cabbage, radish, rapeseed, 
broccoli, etc.), Curcurbitaceae (melons and cucumber), Gramineae (wheat, corn, rice, barley, 
millet, etc.), Soianaceae (potato, tomato, tobacco, peppers, etc.), and various other crops. See 
protocols described in Ammirato et al. (1984) Handbook of Plant Cell Cu lture -Crop Species. 
5 Macmillan Publ. Co. Shimamoto et al. (1989) Nature 338:274-276; Fromm et al. (1990) 
Bio/Technology 8:833-839; and Vasil et al. (1990) Bio/Technology 8:429-434. 

Transformation and regeneration of both monocotyledonous and dicotyledonous 
plant cells is now routine, and the selection of the most appropriate transformation technique will 
be determined by the practitioner. The choice of method will vary with the type of plant to be 

10 transformed; those skilled in the art will recognize the suitability of particular methods for given 
plant types. Suitable methods can include, but are not limited to: electroporation of plant 
protoplasts; Hposome-mediated transformation; polyethylene glycol (PEG) mediated 
transformation; transformation using viruses; micro-injection of plant cells; micro-projectile 
bombardment of plant cells; vacuum infiltration; and Agrobacterium iumeficiens mediated 

1 5 transformation. Transformation means introducing a nucleotide sequence in a plant in a manner to 
cause stable or transient expression of the sequence. 

Successful examples of the modification of plant characteristics by 
transformation with cloned sequences which serve to illustrate the current knowledge in this field 
of technology, and which are herein incorporated by reference, include: U.S. Patent Nos. 

20 5,571,706; 5,677,175; 5,510,471; 5,750,386; 5,597,945; 5,589,615; 5,750,871; 5,268,526; 
5,780,708; 5,538,880; 5,773,269; 5,736,369 and 5,610,042. 

Following transformation, plants are preferably selected using a dominant 
selectable marker incorporated into the transformation vector. Typically, such a marker will 
confer antibiotic or herbicide resistance on the transformed plants, and selection of transformants 

25 can be accomplished by exposing the plants to appropriate concentrations of the antibiotic or 
herbicide. 

After transformed plants are selected and grown to maturity, those plants 
showing a modified trait are identified. The modified trait can be any of those traits described 
above. Additionally, to confirm that the modified trait is due to changes in expression levels or 
30 activity of the polypeptide or polynucleotide of the invention can be determined by analyzing 
mRNA expression using Northern blots, RT-PCR or microarrays, or protein expression using 
immunoblots or Western blots or gel shift assays. 
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INTEGRATED SYSTEMS — SEQUENC E IDENTITY 

Additionally, the present invention may be an integrated system, computer or 
computer readable medium that comprises an instruction set for determining the identity of one or 
more sequences in a database. In addition, the instruction set can be used to generate or identify 

5 sequences that meet any specified criteria. Furthermore, the instruction set may be used to 
associate or link certain functional benefits, such modified structure and development 
characteristics, with one or more identified sequence. 

For example, the instruction set can include, e.g., a sequence comparison or other 
alignment program, e.g., an available program such as, for example, the Wisconsin Package 

1 0 Version 10.0, such as BLAST, FAST A, PILEUP, FINDPATTERNS or the like (GCG, Madision, 
WI). Public sequence databases such as GenBank, EMBL, Swiss-Prot and PIR or private 
sequence databases such as PhytoSeq (Incyte Pharmaceuticals, Palo Alto, CA) can be searched. 

Alignment of sequences for comparison can be conducted by the local homology 
algorithm of Smith and Waterman (198 1) Adv. Annl. Math. 2:482, by the homology alignment 

15 algorithm of Needleman and Wunsch (1970) J.Mol.Biol. 48:443, by the search for similarity 
method of Pearson and Lipman (1988) Proc Natl. Acad. Sci. U.S.A. 85: 2444, by computerized 
implementations of these algorithms. After alignment, sequence comparisons between two (or 
more) polynucleotides or polypeptides are typically performed by comparing sequences of the 
two sequences over a comparison window to identify and compare local regions of sequence 

20 similarity. The comparison window can be a segment of at least about 20 contiguous positions, 
usually about 50 to about 200, more usually about 100 to about 150 contiguous positions. A 
description of the method is provided in Ausubel et al., supra. 

A variety of methods of determining sequence relationships can be used, 
including manual alignment and computer assisted sequence alignment and analysis. This later 

25 approach is a preferred approach in the present invention, due to the increased throughput 
afforded by computer assisted methods. As noted above, a variety of computer programs for 
performing sequence alignment are available, or can be produced by one of skill. 

One example algorithm that is suitable for determining percent sequence identity 
and sequence similarity is the BLAST algorithm, which is described in Altschul et al. J. Mol. Biol 

30 215:403-410 (1990). Software for performing BLAST analyses is publicly available, e.g., 

through the National Center for Biotechnology Information (http://ww.ncbi.nlm.nih.gov/). This 
algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short 
words of length W in the query sequence, which either match or satisfy some positive-valued 
threshold score T when aligned with a word of the same length in a database sequence. T is 
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referred to as the neighborhood word score threshold (Altschul et al., supra). These initial 
neighborhood word hits act as seeds for initiating searches to find longer HSPs containing them. 
The word hits are then extended in both directions along each sequence for as far as the 
cumulative alignment score can be increased. Cumulative scores are calculated using, for 
5 nucleotide sequences, the parameters M (reward score for a pair of matching residues; always > 
0) and N (penalty score for mismatching residues; always < 0). For amino acid sequences, a 
scoring matrix is used to calculate the cumulative score. Extension of the word hits in each 
direction are halted when: the cumulative alignment score falls off by the quantity X from its 
maximum achieved value; the cumulative score goes to zero or below, due to the accumulation of 

10 one or more negative-scoring residue alignments; or the end of either sequence is reached. The 
BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the alignment. 
The BLASTN program (for nucleotide sequences) uses as defaults a wordlength (W) of 1 1, an 
expectation (E) of 10, a cutoff of 100, M=5, N=-4, and a comparison of both strands. For amino 
acid sequences, the BLASTP program uses as defaults a wordlength (W) of 3, an expectation (E) 

1 5 of 10, and the BLOSUM62 scoring matrix (see Henikoff & Henikoff H989) Proc. Natl. Acad. 
Sci. USA 89:10915). 

In addition to calculating percent sequence identity, the BLAST algorithm also 
performs a statistical analysis of the similarity between two sequences (see t e.g., .Karlin & 
Altschul (1993^ Proc. Natl. Acad. Sci. USA 90:5873-5787). One measure of similarity provided 

20 by the BLAST algorithm is the smallest sum probability (P(N)), which provides an indication of 
the probability by which a match between two nucleotide or amino acid sequences would occur 
by chance. For example, a nucleic acid is considered similar to a reference sequence (and, 
therefore, in this context, homologous) if the smallest sum probability in a comparison of the test 
nucleic acid to the reference nucleic acid is less than about 6.1, or less than about 0.01, and or 

25 even less than about 0.001 . An additional example of a useful sequence alignment algorithm is 
PILEUP. PDLEUP creates a multiple sequence alignment from a group of related sequences using 
progressive, pairwise alignments. The program can align, e.g., up to 300 sequences of a 
maximum length of 5,000 letters. 

The integrated system, or computer typically includes a user input interface 

30 allowing a user to selectively view one or more sequence records corresponding to the one or 

more character strings, as well as an instruction set which aligns the one or more character strings 
with each other or with an additional character string to identify one or more region of sequence 
similarity. The system may include a link of one or more character strings with a particular 
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phenotype or gene function. Typically, the system includes a user readable output element which 
displays an alignment produced by the alignment instruction set. 

The methods of this invention can be implemented in a localized or distributed 
computing environment. In a distributed environment, the methods may implemented on a single 
5 computer comprising multiple processors or on a multiplicity of computers. The computers can 
be linked, e.g. through a common bus, but more preferably the computers) are nodes on a 
network. The network can be a generalized or a dedicated local or wide-area network and, in 
certain preferred embodiments, the computers may be components of an intra-net or an internet 
Thus, the invention provides methods for identifying a sequence similar or 

10 homologous to one or more polynucleotides as noted herein, or one or more target polypeptides 
encoded by the polynucleotides, or otherwise noted herein and may include linking or associating 
a given plant phenotype or gene function with a sequence. In the methods, a sequence database is 
provided (locally or across an inter or intra net) and a query is made against the sequence 
database using the relevant sequences herein and associated plant phenotypes or gene functions. 

1 5 Any sequence herein can be entered into the database, before or after querying 

the database. This provides for both expansion of the database and, if done before the querying 
step, for insertion of control sequences into the database. The control sequences can be detected 
by the query to ensure the general integrity of both the database and the query. As noted, the 
query can be performed using a web browser based interface. For example, the database can be a 

20 centralized public database such as those noted herein, and the querying can be done from a 
remote terminal or computer across an internet or intranet. 

EXAMPLES 

The following examples are intended to illustrate but not limit the present 

invention. 

25 EXAMPLE I. FULL LENGTH GENE IDENTIFICATION AND CLONING 

Putative transcription factor sequences (genomic or ESTs) related to known 
transcription factors were identified in the Arabidopsis thaliana GenBank database using the 
tblastn sequence analysis program using default parameters and a P-value cutoff threshold of -4 
or -5 or lower, depending on the length of the query sequence. Putative transcription factor 

30 sequence hits were then screened to identify those containing particular sequence strings. If the 
sequence hits contained such sequence strings, the sequences were confirmed as transcription 
factors. 
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Alternatively, Arabidopsis thaliana cDNA libraries derived from different tissues 
or treatments, or genomic libraries were screened to identify novel members of a transcription 
family using a low stringency hybridization approach. Probes were synthesized using gene 
specific primers in a standard PGR reaction (annealing temperature 60° C) and labeled with 32 P 
5 dCTP using the High Prime DNA Labeling Kit (Boehringer Mannheim). Purified radiolabeled 
probes were added to filters immersed in Church hybridization medium (0.5 M NaP0 4 pH 7.0, 
7% SDS, 1 % w/v bovine serum albumin) and hybridized overnight at 60 °C with shaking. Filters 
were washed two times for 45 to 60 minutes with lxSCC, 1% SDS at 60° C. 

To identify additional sequence 5' or 3' of a partial cDNA sequence in a cDNA 

1 0 library, 5* and 3' rapid amplification of cDNA ends (RACE) was performed using the Marathon™ 
cDNA amplification kit (Clontech, Palo Alto, CA). Generally, the method entailed first isolating 
poly(A) mRNA, performing first and second strand cDNA synthesis to generate double stranded 
cDNA, blunting cDNA ends, followed by ligation of the Marathon™ Adaptor to the cDNA to 
form a library of adaptor-ligated ds cDNA. 

15 Gene-specific primers were designed to be used along with adaptor specific 

primers for both 5* and V RACE reactions. Nested primers, rather than single primers, were used 
to increase PCR specificity. Using 5' and 3' RACE reactions, 5* and 3* RACE fragments were 
obtained, sequenced and cloned. The process can be repeated until 5' and 3' ends of the full- 
length gene were identified. Then the full-length cDNA was generated by PCR using primers 

20 specific to 5 ' and 3' ends of the gene by end-to-end PCR. 

EXAMPLE II. CONSTRUCTION OF EXPRESSION VECTORS 

The sequence was amplified from a genomic or cDNA library using primers 
specific to sequences upstream and downstream of the coding region. The expression vector was 
pMEN20 or pMEN65, which are both derived from pMON316 (Sanders et al, (1987) Nucleic 

25 Acids Research 15:1543-58) and contain the CaMV 35S promoter to express transgenes. To 

clone the sequence into the vector, both pMEN20 and the amplified DNA fragment were digested 
separately with Sail and NotI restriction enzymes at 37° C for 2 hours. The digestion products 
were subject to electrophoresis in a 0.8% agarose gel and visualized by ethidium bromide 
staining. The DNA fragments containing the sequence and the linearized plasmid were excised 

30 and purified by using a Qiaquick gel extraction kit (Qiagen, CA). The fragments of interest were 
ligated at a ratio of 3 : 1 (vector to insert). Ligation reactions using T4 DNA ligase (New England 
Biolabs, MA) were carried out at 16° C for 16 hours. The ligated DNAs were transformed into 
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competent cells of the E. coli strain DH5alpha by using the heat shock method. The 
transformations were plated on LB plates containing 50 mg/1 kanamycin (Sigma). 

Individual colonies were grown overnight in five milliliters of LB broth 
containing 50 mg/1 kanamycin at 37" C. Plasmid DNA was purified by using Qiaquick Mini 
5 Prep kits (Qiagen, CA). 

pyampi F TTT TRANSFOR MATION OF AGR ™* CTRRIUM WTTH THE EXPRESSION 
VECTOR 

After the plasmid vector containing the gene was constructed, the vector was 
used to transform Agrobacterium tumefaciens cells expressing the gene products. The stock of 

10 Agrobacterium tumefaciens cells for transformation were made as described by Nagel et al. 

(1990) FF.MS Microbiol Letts . 67: 325-328. Agrobacterium strain ABI was grown in 250 ml LB 
medium (Sigma) overnight at 28°C with shaking until an absorbance (A«oo) of 0.5 - 1 .0 was 
reached. Cells were harvested by centrifugation at 4,000 x g for 15 min at 4°C. Cells were then 
resuspended in 250 pi chilled buffer (1 mM HEPES, pH adjusted to 7.0 with KOH). Cells were 

15 centrifuged again as described above and resuspended in 125 pi chilled buffer. Cells were then 
centrifuged and resuspended two more times in the same HEPES buffer as described above at a 
volume of 100 pi and 750 pi, respectively. Resuspended cells were then distributed into 40 pi 
aliquots, quickly frozen in liquid nitrogen, and stored at -80° C. 

Agrobacterium cells were transformed with plasmids prepared as described 

20 above following the protocol described by Nagel et al. For each DNA construct to be 

transformed, 50 - 100 ng DNA (generally resuspended in 10 mM Tris-HCl, 1 mM EDTA, pH 
8.0) was mixed with 40 pi of Agrobacterium cells. The DNA/cell mixture was then transferred to 
a chilled cuvette with a 2mm electrode gap and subject to a 2.5 kV charge dissipated at 25 pF and 
200 pF using a Gene Pulser H apparatus (Bic-Rad). After electroporation, cells were 

25 immediately resuspended in 1.0 ml LB and allowed to recover without antibiotic selection for 2 - 
4 hours at 28° C in a shaking incubator. After recovery, cells were plated onto selective medium 
of LB broth containing 100 pg/ml spectinomycin (Sigma) and incubated for 24-48 hours at 28° C. 
Single colonies were then picked and inoculated in fresh medium. The presence of the plasmid 
construct was verified by PCR amplification and sequence analysis. 

30 EXAMPLE IV. TR ANSFORMATION OF ARABIDO P SR PLANTS WITH AGROBACTERIUM 
77 IMF.FACIENS WITH EXPRESSI ON VECTOR 

After transformation of Agrobacterium tumefaciens with plasmid vectors 
containing the gene, single Agrobacterium colonies were identified, propagated, and used to 
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transform Arabidopsis plants. Briefly, 500 ml cultures of LB medium containing 50 mg/1 
kanamycin were inoculated with the colonies and grown at 28° C with shaking for 2 days until an 
absorbance (A«)o) of > 2.0 is reached. Cells were then harvested by centrifugation at 4,000 x g 
for 10 min, and resuspended in infiltration medium (1/2 X Murashige and Skoog salts (Sigma), 1 
5 X Gamborg's B-5 vitamins (Sigma), 5.0% (w/v) sucrose (Sigma), 0.044 \M benzylamino purine 
(Sigma), 200 \iVL Silwet L-77 (Lehle Seeds) until an absorbance (A^oo) of 0.8 was reached. 

Prior to transformation, Arabidopsis thaliana seeds (ecotype Columbia) were 
sown at a density of -10 plants per 4" pot onto Pro-Mix BX potting medium (Hummert 
International) covered with fiberglass mesh (18 mm X 16 mm). Plants were grown under 

10 continuous illumination (50-75 jiE/m 2 /sec) at 22-23° C with 65-70% relative humidity. After 
about 4 weeks, primary inflorescence stems (bolts) are cut off to encourage growth of multiple 
secondary bolts. After flowering of the mature secondary bolts, plants were prepared for 
transformation by removal of all siliques and opened flowers. 

The pots were then immersed upside down in the mixture of Agrobacterium 

1 5 infiltration medium as described above for 30 sec, and placed on their sides to allow draining into 
a T x 2' flat surface covered with plastic wrap. After 24 h, the plastic wrap was removed and 
pots are turned upright. The immersion procedure was repeated one week later, for a total of two 
immersions per pot. Seeds were then collected from each transformation pot and analyzed 
following the protocol described below. 

20 EXAMPLE V. IDENTIFICATION OF ARABIDOPSIS PRIMARY TRANSFORMANTS 
Seeds collected from the transformation pots were sterilized essentially as 
follows. Seeds were dispersed into in a solution containing 0.1% (v/v) Triton X-100 (Sigma) and 
sterile H 2 0 and washed by shaking the suspension for 20 min. The wash solution was then 
drained and replaced with fresh wash solution to wash the seeds for 20 min with shaking. After 

25 removal of the second wash solution, a solution containing 0.1% (v/v) Triton X-100 and 70% 
ethanol (Equistar) was added to the seeds and the suspension was shaken for 5 min. After 
. removal of the ethanol/detergent solution, a solution containing 0.1% (v/v) Triton X-100 and 30% 
(v/v) bleach (Clorox) was added to the seeds, and the suspension was shaken for 10 min. After 
removal of the bleach/detergent solution, seeds were then washed five times in sterile distilled 

30 H 2 0. The seeds were stored in the last wash water at 4° C for 2 days in the dark before being 

plated onto antibiotic selection medium (1 X Murashige and Skoog salts (pH adjusted to 5.7 with 
1M KOH), 1 X Gamborg's B-5 vitamins, 0.9% phytagar (Life Technologies), and 50 mg/1 
kanamycin). Seeds were germinated under continuous illumination (50-75 jiE/m 2 /sec) at 22-23° 



36 



PCT/USOO/31325 

WO 01/36444 

C. After 7-10 days of growth under these conditions, kanamycin resistant primary transformants 
(T, generation) were visible and obtained. These seedlings were transferred first to fresh 
selection plates where the seedlings continued to grow for 3-5 more days, and then to soil (Pro- 
Mix BX potting medium). 

Primary transformants were crossed and progeny seeds (Tj) collected; kanamycin 
resistant seedlings were selected and analyzed. The expression levels of the recombinant 
polynucleotides in the transformants varies from about a 5% expression level increase to a least a 
100% expression level increase. Similar observations are made with respect to polypeptide level 
expression. 



10 



EXAMPI - VI IDEmglCATION OF A? APTOOPSTS PLANTS WITH JRANSCTIPTION 
F A £TQB KNOCKOUTS 

The screening of insertion mutagenized Arabidopsis collections for null mutants 
in a known target gene was essentially as described in Krysan et al (1999) Plant£ell 1 1:2283: 

1 5 2290. Briefly, gene-specific primers, nested by 5-250 pb to each others, were designed from the 
5' and 3» regions of a known target gene. Similarly, nested sets of primers were also created 
specific to each of the T-DNA or transposon ends (the "right" and "left" borders). All possible 
combinations of gene specific and T-DNA/transposon primers were used to detect by PCR an 
insertion event within or close to the target gene. The amplified DNA fragments were then 

20 sequenced which allows the precise determination of the T-DNA/transposon insertion pomt 
relative to the target gene. Insertion events within the coding or intervening sequence of the 
genes were deconvolved from a pool comprising a plurality of insertion events to a single unique 
mutant plant for functional characterization. The method is described in more detail in Yu and 
Adam, US Application Serial No. 09/177,733 filed October 23, 1998. 

o< rY A>fmrVTT TTF>rr ^PATir>N OF STRUCT iTBP ANT) DEVELOPMENT 

25 ^^^ SnTVPP IN OVEREXPPFSSOF O P OKNH KNOCKOUT 
PLANTS 

Experiments were performed to identify those transformants or knockouts that 
exhibited a modified structure and development characteristics. For such studies, the 
30 transformants were observed by eye to identify novel structural or developmental characteristics 
associated with the ectopic expression of the polynucleotides or polypeptides of the invention. 

Table 3 shows the phenotypes observed for particular overexpressor or knockout 
plants and provides the SEQ ID No., the internal reference code (GID), whether a knockout or 
overexpressor plant was analyzed and the observed phenotype. 
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Table 3 



SEQ ID No. 


GID 


Knockout (KO) or 
overexpressor (KO) 


Phenotype observed 


i 
i 


am 


OE 


Plants were small, and more dark green in 
color, late flowering and poorly fertile. 


i 


G712 


OE 


Plants were small and inflorescence was 
unelongated. Flowers parts appeared to be un- 
elongated and the plants were semi-sterile. 


5 


G9 


OE 


Increased root mass 


7 


G428 


OE 


Lobed and highly serrated leaves and 
abnormal first and second whorl floral organs 


9 


G869 


OE 


Undeveloped or small anthers 


It 


G1269 


OE 


Extended petioles and leaves pointed upwards 


13 


G1038 


OE 


Altered leaf shape 


15 


G438 


vr\ 
¥JJ 


Keaucea ugnin in stem 


17 


G571 


KO 


Delayed senescence at the end of the plant 
lifecycle 


19 


G748 


OE 


More vascular bundles in stem 


21 


G431 


OE 


Severe developmental abnormalities such as 
altered branching, twisted rosette leaves, 
flowers with missing pistils, fused stamens and 
atypical numbers of petals and stamens, 
reduced secondary bolts, and lack of cauline 
leaves. 


23 


G187 


OE 


Plants had long, thin cotyledons and reduced 
apical dominance. Several flower 
abnormalities, including underdeveloped, 
sepaloid petals and underdeveloped anthers 
were also observed. 


25 


G470 


OE 


Plants were sterile due to failure of anthers to 
elongate 


27 


G615 


OE 


Plants were sterile due to failure of anthers to 
develop and failure of stamens to elongate. 
Fused cotyledons and absence of a shoot apical 
meristem and true leaves was also observed. 


29 


G1073 


OE 


Increased plant size and serrated leaves 



For a particular overexpressor that shows a less beneficial structure and development 
characteristic, it may be more useful to select a plant with a decreased expression of the particular 
transcription factor. For a particular knockout that shows a less beneficial structure and 
development characteristic, it may be more useful to select a plant with an increased expression 
of the particular transcription factor. 
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FYAMPT F. vm IDEN TIFICATION OF HOMOLOGOUS SEQUENCES 

Homologous sequences from Arabidopsis and plant species other than Arabidopsis were 
identified using database sequence search tools, such as the Basic Local Alignment Search Tool 
(BLAST) (Altschul et al. (1990) J.Mol.Biol. 215:403410; and Altschul et al. (1997) Nucl. Acid 
5 Res. 25: 3389-3402). The tblastx sequence analysis programs were employed using the 

BLOSUM-62 scoring matrix (Henikoff, S. and Henikoff, J. G. (1992) Proc Natl. Acad. Sci. USA 
89: 10915-10919). 

Identified Arabidopsis homologous sequences are provided in Figure 2 and included in 
the Sequence Listing. The percent sequence identity among these sequences is as low as 47% 

10 sequence identity. Additionally, the entire NCBI GenBank database was filtered for sequences 
from all plants except Arabidopsis thaliana by selecting all entries in the NCBI GenBank 
database associated with NCBI taxonomic ID 33090 (Viridiplantae; all plants) and excluding 
entries associated with taxonomic ID 3701 {Arabidopsis thaliana). These sequences were 
compared to sequences representing genes of SEQ IDs Nos. 1-54 on 9/26/2000 using the 

15 Washington University TBLASTX algorithm (version 2.0al9MP). For each gene of SEQ IDs 
Nos. 1-54, individual comparisons were ordered by probability score (P-value), where the score 
reflects the probability that a particular alignment occurred by chance. For example, a score of 
3.6e-40 is 3.6 x lO" 40 . For up to ten species, the gene with the lowest P-value (and therefore the 
most likely homolog) is listed in Figure 3. 

20 In addition to P-values, comparisons were also scored by percentage identity. Percentage 

identity reflects the degree to which two segments of DNA or protein are identical over a 
particular length. The ranges of percent identity between the non-Arabidopsis genes shown in 
Figure 3 and the Arabidopsis genes in the sequence listing are: SEQ ID No. 1: 36%-69%; SEQ ID 
No. 3: 46%-54%; SEQ ID No. 5: 57%-72%; SEQ ID No. 7: 54%-69%; SEQ ID No. 9: 3 1%-68%; 

25 SEQ ID No. 1 1: 47%-90%; SEQ ID No. 13: 34%-82%; SEQ ID No. 15: 49%-88%; SEQ ID No. 
17: 56%-67%; SEQ ID No. 19: 39%-61%; SEQ ID No. 21 : 61%-87%; SEQ ID No. 23: 38%- 
85%; SEQ ID No. 25: 44%-94%; SEQ ID No. 27: 35%-44%; SEQ ID No. 29: 37%-71%; SEQ ID 
No. 31: 38%-77%; SEQ ID No. 33: 57%-69%; SEQ ID No. 35: 54%-69%; SEQ ID No. 37: 60%- 
75%; SEQ ID No. 39: 47%-65%; SEQ ID No. 41: 60%-88%; SEQ ID No. 43: 43%-87%; and 

30 SEQ ID No. 45 : 53%-97%. 

The polynucleotides and polypeptides in the Sequence Listing and the identified 
homologous sequences may be stored in a computer system and have associated or linked with 
the sequences a function, such as that the polynucleotides and polypeptides are useful for 
modifying the structure and development characteristics of a plant. 
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All references, publications, patents and other documents herein are incorporated by 
reference in their entirety for all purposes. Although the invention has been described with 
reference to the embodiments and examples above, it should be understood that various 
modifications can be made without departing from the spirit of the invention. 
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What is claimed is: 

1 . A transgenic plant with modified structure and development characteristics, which plant 
comprises a recombinant polynucleotide comprising a nucleotide sequence selected from the 
group consisting of: 

(a) a nucleotide sequence encoding a polypeptide comprising a sequence selected from 
SEQ ID Nos. 2N, where N=l-23, or a complementary nucleotide sequence thereof; 

(b) a nucleotide sequence encoding a polypeptide comprising a conservatively substituted 
variant of a polypeptide of (a); 

(c) a nucleotide sequence comprising a sequence selected from those of SEQ ID Nos. 2N- 
l, where N=l-23, or a complementary nucleotide sequence thereof; 

(d) a nucleotide sequence comprising silent substitutions in a nucleotide sequence of (c); 

(e) a nucleotide sequence which hybridizes under stringent conditions to a nucleotide 
sequence of one or more of: (a), (b), (c), or (d); 

(f) a nucleotide sequence comprising at least 15 consecutive nucleotides of a sequence of 
anyof(aMe); 

(g) a nucleotide sequence comprising a subsequence or fragment of any of (a)-(f), which 
subsequence or fragment encodes a polypeptide that modifies a plant's structure and 
development characteristics; 

(h) a nucleotide sequence having $t least 31% sequence identity to a nucleotide sequence 
of any of (a)-(g); 

(i) a nucleotide sequence having at least 60% identity sequence identity to a nucleotide 
sequence of any of (a)-(g); 

(j) a nucleotide sequence which encodes a polypeptide having at least 31% identity 

sequence identity to a polypeptide of SEQ ID Nos. 2N, where N=l-23; 

(k) a nucleotide sequence which encodes a polypeptide having at least 60% identity 

sequence identity to a polypeptide of SEQ ID Nos. 2N, where N=l-23; and 

(1) a nucleotide sequence which encodes a polypeptide having at least 65% sequence 

identity to a conserved domain of a polypeptide of SEQ ID Nos. 2N, where N=l-23. 

2. The transgenic plant of claim 1 , further comprising a constitutive, inducible, or tissue- 
active promoter operably linked to said nucleotide sequence. 

3. The transgenic plant of claim 1 , wherein the plant is selected from the group consisting 
of: soybean, wheat, corn, potato, cotton, rice, oilseed rape, sunflower, alfalfa, sugarcane, turf, 
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banana, blackberry, blueberry, strawberry, raspberry, cantaloupe, carrot, cauliflower, coffee, 
cucumber, eggplant, grapes, honeydew, lettuce, mango, melon, onion, papaya, peas, peppers, 
pineapple, spinach, squash, sweet corn, tobacco, tomato, watermelon, rosaceous fruits, and 
vegetable brassicas. 

5 

4. An isolated or recombinant polynucleotide comprising a nucleotide sequence selected 
from the group consisting of: 

(a) a nucleotide sequence encoding a polypeptide comprising a sequence selected from 
SEQ ED Nos. 2N, where N=l-23, or a complementary nucleotide sequence thereof; 
10 (b) a nucleotide sequence encoding a polypeptide comprising a conservatively substituted 

variant of a polypeptide of (a); 

(c) a nucleotide sequence comprising a sequence selected from those of SEQ ID Nos. 2N- 
1, where N=l -23, or a complementary nucleotide sequence thereof; 

(d) a nucleotide sequence comprising silent substitutions in a nucleotide sequence of (c); 
15 (e) a nucleotide sequence which hybridizes under stringent conditions to a nucleotide 

sequence of one or more of: (a), (b), (c), or (d); 

(f) a nucleotide sequence comprising at least 15 consecutive nucleotides of a sequence of 
any of (a)-(e); 

(g) a nucleotide sequence comprising a subsequence or fragment of any of (a)-(f), which 
20 subsequence or fragment encodes a polypeptide that modifies a plant's structure and 

development characteristics; 

(h) a nucleotide sequence having at least 3 1% sequence identity to a nucleotide sequence 
of any of (a)-(g); 

(i) a nucleotide sequence having at least 60% identity sequence identity to a nucleotide 
25 sequence of any of (a)-(g); 

(j) a nucleotide sequence which encodes a polypeptide having at least 3 1% identity 
sequence identity to a polypeptide of SEQ ID Nos. 2N, where N=l-23; 
(k) a nucleotide sequence which encodes a polypeptide having at least 60% identity 
sequence identity to a polypeptide of SEQ ID Nos. 2N, where N=l -23; and 
30 (1) a nucleotide sequence which encodes a conserved domain of a polypeptide having at 

least 65% sequence identity to a conserved domain of a polypeptide of SEQ ID Nos. 2N, 
where N=l-23. 
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inducible, or tissue-active promoter operably linked to the nucleotide sequence. 

6 . A cloning or expression vector comprising the isolated or recombinant polynucleotide of 
claim 4. 

7. . A cell comprising the cloning or expression vector of claim 6. 

8. A transgenic plant comprising the isolated or recombinant polynucleotide of claim 4. 

9 A composition produced by one or more of: 

(a) incubating one or more polynucleotide of claim4 with a nuclease; # 

(b) incubating one or more polynucleotide of claim 4 with a restriction enzyme; 

(c) incubating one or more polynucleotide of claim 4 with a polymerase; 

(d) incubating one or more polynucleotide of claim 4 with a polymerase and a pnmer; 

(e) incubating one or more polynucleotide of claim 4 with a cloning vector, or 

(f) incubating one or more polynucleotide of claim 4 with a cell. 

10. A composition comprising two or more different polynucleotides of claim 4. 

! , An isolated or recombinant polypeptide comprising a subsequence of at least about 15 
contiguous amino acids encoded by the recombinant or isolated polynucleotide of c.a.m 4. 

12. A plant ectopically expressing an isolated polypeptide of claim 1 1. 

1 3 A method for producing a plant having a modified structure and development 
characteristic, the method comprising altering the expression of the isolated or recombmant 
polynucleotide of claim 4 or the expression levels or activity of a polypeptide of clann lima 
plant, thereby producing a modified plant, and selecting the modified plant for modified structure 
) and development characteristics thereby providing the modified plant with a modified structure 
and development characteristics. 

14. The method of claim 13, wherein the polynucleotide is a polynucleotide of claim 4. 
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15. A method of identifying a factor that is modulated by or interacts with a polypeptide 
encoded by a polynucleotide of claim 4, the method comprising: 

(a) expressing a polypeptide encoded by the polynucleotide in a plant; and 

(b) identifying at least one factor that is modulated by or interacts with the polypeptide. 

5 

16. The method of claim 15, wherein the identifying is performed by detecting binding by the 
polypeptide to a promoter sequence, or detecting interactions between an additional protein and 
the polypeptide in a yeast two hybrid system. 

10 17. The method of claim 15, wherein the identifying is performed by detecting expression of 
a factor by hybridization to a microarray, subtractive hybridization or differential display. 

18. A method of identifying a molecule that modulates activity or expression of a 
polynucleotide or polypeptide of interest, the method comprising: 

1 5 (a) placing the molecule in contact with a plant comprising the polynucleotide or 

polypeptide encoded by the polynucleotide of claim 4; and, 
(b) monitoring one or more of: 

(i) expression level of the polynucleotide in the plant; 

(ii) expression level of the polypeptide in the plant; 

20 (iii) modulation of an activity of the polypeptide in the plant; or 

(iv) modulation of an activity of the polynucleotide in the plant. 

19. An integrated system, computer or computer readable medium comprising one or more 
character strings corresponding to a polynucleotide of claim 4, or to a polypeptide encoded by the 

25 polynucleotide. 

20. The integrated system, computer or computer readable medium of claim 1 9, further 
comprising a link between said one or more sequence strings to a modified plant structure and 
development characteristics phenotype. 

30 

21. A method of identifying a sequence similar or homologous to one or more 
polynucleotides of claim 4, or one or more polypeptides encoded by the polynucleotides, the 
method comprising: 

(a) providing a sequence database; and, 
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(b) querying the sequence database with one or more target sequences corresponding to 
the one or more polynucleotides or to the one or more polypeptides to identify one or 
more sequence members of the database that display sequence similarity or homology to 
one or more of the one or more target sequences. 



20 



22 



The method of claim 21, wherein the querying comprises aligning one or more of the 
target sequences with one or more of the one or more sequence members in the sequence 
database. 

10 23. The method of claim 21, wherein the querying comprises identifying one or more of the 
one or more sequence members of the database that meet a user-selected identity criteria with one 
or more of the target sequences. 

24. The method of claim 21, further comprising linking the one or more of the 

15 polynucleotides of claim 4, or encoded polypeptides, to a modified plant structure and 
development characteristics phenotype. 

25. A plant comprising altered expression levels of an isolated or recombinant polynucleotide 
of claim 4. 



26. A plant comprising altered expression levels or the activity of an isolated or recombinant 
polypeptide of claim 1 1 . 

27. A plant lacking a nucleotide sequence encoding a polypeptide of claim 1 1 . 



25 
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Figure 1 



SEQ ID No. 


GID 


cDNA or protein 


conserved domain 


1 


G727 


cDNA 




2 


G727 


protein 


226-269 


3 


G732 


cDNA 




4 


G732 


protein 


31-9 


5 


G9 


cDNA 




6 


G9 


protein 


62-127 


7 


G428 


cDNA 




8 


G428 


protein 


229-292 


g 


G869 


cDNA 




10 


G869 


protein 


109-177 


11 


G1269 


cDNA 




12 


G1269 


protein 


27-83 


13 


G1038 


cDNA 




14 


G1038 


protein 


198-247 


15 


G438 


cDNA j 




16 


G438 


nrotein 


22-85 


17 


G571 


cDNA 




18 


G571 


orotein 


160-220 


19 


G748 


cDNA 




20 


G748 


protein 


112-140 


21 


G431 


cDNA 




22 


G431 


protein 


286-335 


23 


G187 


cDNA 




24 


G187 


protein 


172-228 


25 


G470 


cDNA 




26 


G470 


protein 


61-393 


27 


G615 


cDNA 




28 


G615 


protein 


88-147 


29 


G1073 


cDNA 




30 


G1073 


protein 


33-42, 78-175 
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Figure 2 



SEQ ID No. 


GID 


homolog 


cDNA or protein 


conserved domain 


31 


G1493 


homolog of G727 


cDNA 




32 


G1493 


homolog of G727 


protein 


242-289 


33 


G993 


homolog of <39 


cDNA 




34 


G993 


homolog of G9 


protein 


69-134 


35 


G867 


homolog of G9 


cDNA 




36 


G867 


homolog of G9 


protein 


59-124 


37 


G1930 


homolog of G9 


cDNA 




i 38 


G1930 


homolog of G9 


protein 


59-124 


39 


G1594 


homolog of G428 


cDNA 




40 


G1594 


homolog of G428 


protein 


262-325 


41 


G391 


homolog of G438 


cDNA 




42 


G391 


homolog of G438 


protein 


25-85 


43 


G390 


homolog of G438 


cDNA 




44 


G390 


homolog of G438 


protein 


18-81 


45 


G1548 


homolog of G438 


cDNA 




46 


G1548 


homolog of G438 


protein 


17-77 
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Figure 3A 



SEQ ID No. 


GID 


GenDank nid 


n * »+\ I* ft 

r-vaiue 


opecies i 


1 


G727 


7283684 


2.20E-56 


Glycine max I 


1 


G727 


7206180 


8.40E-42 


Medicago truncatula 


1 


G727 


7614196 


2.20E-40 


Lotus japonicus 


1 


G727 


572293 


1.20E-31 


Oryza satlva 


1 


G727 


7218448 


7.70E-30 


Sorghum blcolor 




G727 


9291284 


1 .80E-27 


Lycopersicon hirsutum 




G727 


8901641 


5.10E-27 


Hordeum vulgare 




G727 


8380453 


6.60E-24 


Gossypium arboreum 


1 


G727 


9962201 


2.10E-12 


Cryptomeria japonica 


1 


G727 


8122498 


3.10E-08 


Lycopersicon esculentum 


3 


G732 


5048074 


5.60E-30 


Gossypium hirsutum 


3 


G732 


4384142 


6.10E-30 


Lycopersicon esculentum 


3 


G732 


7623218 


6.10E-30 


Gossypium arboreum 


3 


G732 


4457220 


1 .80E-29 


Capsicum chinense 


3 


G732 


7284989 


4.50E-28 


Glycine max 


3 


G732 


9650827 


1.20E-27 


Petroselinum crispum 


3 


G732 


7205618 


2.20E-26 


Medicago truncatula 


3 


G732 


3854258 


1.40E-22 


Populus tremula x Populus tremuloides 


5 


G9 


7643366 


6.80E-56 


Medicago truncatula 


5 


G9 


8669779 


4.20E-50 


Glycine max 


5 


G9 


8329389 


1.50E-48 


Mesembryanthemum crystallinum 


5 


G9 


9851335 


3.50E-42 


Sorghum bicolor 


5 


G9 


7412012 


1.50E-41 


Lycopersicon esculentum 


5 


G9 


10450225 


1.30E-38 


Solanum tuberosum 


5 


G9 


8902194 


8.30E-36 


Hordeum vulgare i 


5 


G9 


7722547 


2.60E-33 


Lotus japonicus 


5 


G9 


9696857 


1 .90E-32 


Triticum aestivum 


5 


G9 


7324245 


2.40E-32 


Lycopersicon pennellii 


7 


G428 


3327268 


5.50E-65 


Ipomoea nil 


7 


G428 


4589883 


1 .20E-60 


Nicotiana tabacum 


7 


G428 


1814233 


2.20E-56 


Solanum tuberosum 


7 


G428 


7581978 


8.50E-56 


Dendrobium grex Madame Thong-In 


7 


G428 


4098241 


1.50E-53 


Lycopersicon esculentum 


7 


G428 


4099825 


1 .30E-38 


Picea mariana 


7 


G428 


3462611 


2.50E-38 


Pisum sativum 


7 


G428 


3928842 


1 .90E-37 


Picea abies 


7 


G428 


9699343 


2.70E-35 


Triticum aestivum 


7 


G428 


1008878 


4.80E-35 


Zea mays 


9 


G869 


10235055 


1.00E-19 


Glycine max 


9 


G869 


2213784 


1.60E-19 


Lycopersicon esculentum 


9 


G869 


3065894 


9.20E-19 


Nicotiana tabacum 


9 


G869 


8570080 


5.30E-18 


Oryza sativa 


9 


G869 


7560260 


1.90E-17 


Medicago truncatula 


9 


G869 


9850452 


y.ijut:- ID 


oorgnum Dicoior 


9 


G869 


9963144 


1.10E-13 


Cryptomeria japonica 


9 


G869 


9660634 


1.90E-13 


Secale cereale 


9 


G869 


9362061 


3.40E-13 


Triticum aestivum 


9 


G869 


7788764 


7.20E-13 


Lotus japonicus 


11 


G1269 


9565366 


7.00E-37 


Glycine max 


11 


G1269 


5272360 


8.10E-37 


Lycopersicon esculentum 


11 


G1269 


9119112 


8.40E-28 


Medicago truncatula 


11 


G1269 


9852711 


2.10E-22 


Sorghum bicolor 
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Figure 3B 



SEQ ID No. 
11 i 
11 
11 
11 


GID 

G1269 
G1269 
G1269 
G1269 
G1269 


Genbank NID 
9255178 
10447957 
9435251 

OOCOA1A 

9696112 


P-value ]i 
1.10E-18 : 
8.60E-15 I 
1.20E-09 I 

3.80E-09 " 


Species 
lea mays 

Solanum tuberosum 
-lordeum vulgare 

3 opulus balsamifera subsp. trichocarpa 
rriticum aestivum 


11 
11 
13 
13 
13 
13 
13 
13 
13 


G1269 
G1038 
G1038 
G1038 
G1038 
G1038 
G1038 
G1038 


8213273 
8748344 

71QOCQ/I 

7Zoooo4 
7218448 
57229o 
8901641 
9962201 
7614196 


4.90E-09 < 
8.00E-37 

O.ZUC-OO ' 

8.80E-36 

O OAC OC 

4.30E-28 
2.20E-16 
6.50E-11 


Dryza sativa 
Wedicaqo truncatula 

f^lvrinp max 

Sorqhum bicolor 

nrvra sativa 

rtordeum vulgare 
Cryptomeria japonica 
| phiejaponicus 


13 
13 
13 

15 I 


G1038 
G1038 
G1038 
G438 


9291272 [ 
8122498 
9883662 
7209474 


0.00015 
n nnn^ 

0.68 
8.70E-204 


Lycopersicon hirsutum 
i urnnpr<;iron esculentum 
Triticum aestivum 
Oryza sativa 


15 
15 


G438 
G438 


7209911 
7571387 


2.20E-142 
2.30E-80 


Physcomitrella patens 
Medicago truncatula 


15 
15 
15 
15 
15 


G438 
G438 
G438 
G438 
G438 


8330425 
6531 152 
6726825 
5269007 
9253000 


3.00E-66 

a cnC CZA 

i .oUb-o** 
4.70E-61 
7.00E-54 
1.70E-47 


Mesembryanthemum crystallinum 
Glycine max 

Zea mays 

Solanum tuberosum 


15 
15 
17 
17 
17 
17 


G438 
G438 
G571 
G571 
G571 
G571 


8967371 
2963336 
6288681 
297019 
10423526 
5926681 


1.60E-34 
1.50E-70 
1.60E-68 
2.20E-61 
4.20E-61 


WnrHpum vnlnare 
Pinus taeda 
Nicotiana tabacum 
Zea mays 

Ory7» sativa 

Triticum afistivum 


17 
17 


G571 
G571 


4959969 
1372965 

*\/s/\o O O O 


1.90E-59 
1.20E-56 

a one AG. 


Lycopersicon esculentum 

V/iria faha 


17 
17 


G571 
G571 


8098832 
9566058 


1 .ZUb-40 
2.00E-43 


Glycine max 


17 


G571 


765198 


1.50E-41 
3.80E-41 


Solanum tuberosum 
Nicotiana sp. 


17 
19 
19 
19 

L 19 

19 
19 


G571 
G748 
G748 
G748 
G748 
G748 
G748 


19679 
853689 
7242897 
5888560 
! 6341666 
10700058 
7535776 


7.00E-87 

3.90E-59 

a one Ati 
1 .ZUk-40 

5.60E-38 

1.10E-36 

5.00E-33 


Cucurbita maxima 

Ory/a sativa 

1 w/^rtnorcirnn ASClltentum 

Glycine max 
Medicaqo truncatula 
Sorghum bicolor 


I 19 
19 

19 

19 

21 

21 


G748 
G748 
G748 
G748 
G431 
G431 


9419494 

f\AAf\A C7 
94lU10f 

3929324 
10449922 
7340349 
3462611 


2.10E-31 
a nnP-°ft 

4.30E-25 
2.30E-23 
9.90E-177 
1.20E-112 


Hordeum vulgare 

Triticum aestivum 

Dendrobium grex Madame Thong-IN 
" Solanum tuberosum 
Brassica oleracea 
Pisum sativum 


21 
21 
21 
21 

i 21 


G431 
G431 
G431 
G431 
G431 


310568 
2251078 
4098239 
1008878 
6942299 


1.50E-112 
1.90E-107 
1.20E-104 
4.90E-62 
7.90E-62 


Glycine max 
Nicotiana tabacum 
Lycopersicon esculentum 
Zea mays 
Triticum aestivum 
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Figure 3C 



SEQ ID No. 


GID 


Genbank NID 


P-value 


Species 


21 


G431 


3327239 


1.90E-61 


Oryza sativa 


21 


G431 


3928842 


1.60E-59 


Picea abies 


21 


G431 


2522483 


2.30E-59 


Hordeum vulgare 


23 


G187 


9304207 


2.10E-35 


Sorghum bicolor 


! 23 


G187 


9444636 


3.20E-34 


Triticum aestivum 


23 


G187 


5058292 


3.60E-34 


Glycine max 


23 


G187 


7721184 


2.40E-32 


Lotus japonicus 


23 


G187 


7562279 


1.20E-31 


Medicago truncatula I 


23 


G187 


8105974 


3.00E-29 


Lycopersicon esculentum 


23 


G187 


9049477 


1.60E-27 


Oryza sativa 


23 


G187 


9187621 


1.60E-23 


Solanum tuberosum 


23 


G187 


5268376 


5.60E-23 


Zea mays 


23 


G187 


4894964 


1.70E-22 


Avena sativa 


25 


G470 


6917173 


4.80E-78 


Lycopersicon pennellii 


25 


G470 


8827792 


8.50E-70 


Glycine max 


25 


G470 


5272309 


7.40E-69 


Lycopersicon esculentum 


25 


G470 


7563870 


6.70E-68 


Medicago truncatula \ 


25 


G470 


5296108 


5.50E-65 


Zea mays 


25 


G470 


7339690 


7.40E-57 


Oryza sativa 


25 


G470 


5047367 


1.30E-51 


Gossypium hirsutum 


25 


G470 


9856054 


9.70E-50 


Sorghum bicolor 


25 


G470 


3857884 


1.10E-38 


Populus balsamifera subsp. trichocarpa 


25 


G470 


8174666 


6.40E-37 


Hordeum vulgare 


27 


G615 


5566284 


2.00E-28 


Linaria vulgaris 


I 27 


G615 


6358617 


3.20E-27 


Antirrhinum graniticum 


27 


G615 


6358613 


1.40E-26 


Antirrhinum majus subsp. cirrhigerum 


27 


G615 


6358545 


8.60E-26 


Digitalis purpurea 


27 


G615 


6358538 


1 .40E-25 


Antirrhinum braun-blanquetii 


i 27 


G615 


6358541 


1.40E-25 


Misopates orontium 


I 27 


G615 


6358542 


1.40E-25 


Antirrhinum molle ! 


27 


G615 


6358573 


1.40E-25 


Misopates calycinum 


27 


G615 


6358546 


1.80E-25 


Antirrhinum siculum 


27 


G615 


2826867 


2.70E-25 


Antirrhinum majus ! 


! 29 


G1073 


7238733 


2.70E-55 


Medicago truncatula 


29 


G1073 


10843924 


1.50E-44 


Glycine max ! 


29 


G1073 


7615218 


2.00E-42 


Lotus japonicus 


29 


G1073 


7333102 


3.40E-34 


Lycopersicon esculentum 


29 


G1073 


9689692 


8.60E-28 


Pinus taeda 


I 29 


G1073 


9445090 


4.30E-25 


Triticum aestivum 


29 


G1073 


9252370 


2.80E-24 


Solanum tuberosum 


I 29 


G1073 


5042437 


5.80E-21 


Oryza sativa 


I 29 


G1073 


7536402 


6J0E-20 


Sorghum bicolor 


29 


G1073 


9662742 


2.70E-19 


Secale cereale 


I 31 


G1493 


7614196 


2.20E-50 


Lotus japonicus 


! 31 


G1493 


9986889 


6.10E-48 


Glycine max 


I 31 


G1493 


8748344 


2.20E-38 


Medicago truncatula 


! 31 


G1493 


572293 


1.70E-37 


Oryza sativa 


I 31 


G1493 


7218448 


5.70E-33 


Sorghum bicolor 


I 31 


G1493 


9291284 


9J0E-32 


Lycopersicon hirsutunrv 


31 


G1493 


8380453 


1.60E-30 


Gossypium arboreum 


I 31 


G1493 


8901641 


1J0E-30 


Hordeum vulgare 


! 31 


G1493 


9962201 


6.90E-17 


Cryptomeria japonica 


I 31 


G1493 


8122498 


1.50E-O8 


Lycopersicon esculentum 
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Figure 3D 



SEQ ID No. ! 
33 
33 
33 
33 


GID 

G993 
G993 
G993 
G993 


Genbank NID 
7643366 
8329389 
8669779 
9851335 


P-value < 
1.20E-58 I 
1 .oot-4y i 
6.10E-49 < 
6.30E-43 * 


Species 

vtedicago truncatula 

^ocomhrvanthpmi im rrv^taMinum 

Slycine max 
Sorqhum bicolor 


33 
33 
33 

33 i 

33 

33 


G993 
G993 
G993 
G993 
G993 
G993 


4384549 
10450225 
8902194 
7719409 
8749037 
9247126 


5.20E-40 
3.70E-39 
2.50E-34 
1.30E-32 
5.20E-32 
1.30E-30 
2.20E-57 


Lycopersicon esculentum 
Solanum tuberosum 
Hordeum vulgare 
Lotus japonicus 
Citrus x paradisi 
Oryza sativa 
Medicaqo truncatula 


35 
35 


G867 
G867 


7643366 
8329389 


1.10E-50 


Mesembryanthemum crystallinum 


35 


G867 


8669779 


2.70E-46 


Glycine max 


35 
35 


G867 
G867 


10450225 
9851335 


3.60E-41 
2.80E-40 


Solanum tuberosum 
Snrghum bicolor 


35 


G867 
G867 


9430646 
8902194 


7.20E-40 
1.60E-34 


Lycopersicon escuieniuiii 
Hordeum vulgare 


35 


G867 


7722547 


1.30E-33 


Lotus japonicus 


35 


G867 


7324245 


3.90E-32 


I ycnpersicon pennellii 


35 


G867 


8749037 


1.40E-31 


Citrus * paradisi 


37 


G1930 


7643366 


9.70E-57 


Medicago truncatula 




G1930 


8329389 


4.50E-47 


Mesembryanthemum crystallinum 


37 


G1930 


6069592 


1.10E-46 


Glycine max 


37 


G1930 


10450225 


6.50E-42 


Solanum tuberosum 
Lycopersicon esculentum 


37 
37 


G1930 
G1930 


9430646 
9851335 


8.20E-39 
1.80E-38 


Sorghum bicolor 


37 


G1930 


7722547 


4.70E-34 


Lotus japonicus 
Lycopersicon pennellii 


37 
37 


G1930 
G1930 


7324245 
8902194 


1.20E-32 
3.00E-31 


Hordeum vulgare 
Triticum aestivum 


37 

Of 

39 


G1930 
G1594 


9697984 
3327268 


4.60E-29 
2.60E-74 


Ipnmoea nil 


i 39 


G1594 


7581978 


9.20E-62 
1.50E-47 


Dendrobium qrex Madame Thong-In 
Oryza sativa 


39 
39 


G1594 
G1594 


4887609 
1814233 


4.00E-46 


Solanum tuberosum 


39 


G1594 


4589883 


6.30E-43 


Nicotiana tabacum 


39 


G1594 


4098241 


6.70E-43 


Lycopersicon esculentum 


39 


G1594 


3928842 


2.00E-42 
2.60E-42 


Picea abies 
Picea mariana 


39 
3Q 


G1594 
G1594 


4099825 
4240538 


1.70E-4t 


Zea mays 


3Q 


G1594 


1946219 


1.90E-41 


Malus domestica 


«t 1 


G391 


7209474 


4.70E-194 


Oryza sativa 


** 1 


G391 


720991 1 


2.10E-145 


Physcomitrella patens 




G391 


7560927 


8.70E-67 


Medicaqo truncatula 


41 


G391 


10808354 


1.50E-61 


Solanum tuberosum 


41 


G391 


5893826 


7.00E-60 


Lycopersicon esculentum 


41 


G391 


8330425 


8.60E-59 


Mesembryanthemum crystallinum 


41 


G391 


8284059 


8.70E-57 


Glycine max 


41 


G391 


5269007 


8.10E-46 


7m mays 


41 


G391 


9419425 


1.70E-43 


Hordeum vulgare 


41 


G391 


2963336 


2.10E-37 


Pinus taeda 


43 


G390 


7209474 


2.50E-166 


> Oryza sativa 


43 


G390 


720991 1 


1.70E-14S 


> Physcomitrella patens 


43 


G390 


7560927 


5.80E-81 


Medicago truncatula 
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Figure 3E 



SEQ ID No. 


GID 


Genbank NID 


P-value 


Species 


43 


G390 


7409018 


1.80E-68 


Lycopersicon esculentum 


43 


G390 


8071613 


3.00E-60 


Solanum tuberosum 


43 


G390 


9466042 


1.60E-59 


Hordeum vulgare 


43 


G390 


8284059 


1.00E-57 


Glycine max 


1 43 


G390 


8330425 


2.60E-44 


Mesembryanthemum crystallinum 


43 


G390 


5269007 


4.60E-44 


Zea mays 


\ 43 


G390 


2963336 


4.90E-43 


Pinus taeda 


! 45 


G1548 


7209474 


5.90E-169 


Oryza sativa 


45 


G1548 


720991 1 


3.30E-140 


Physcomilrella patens 


45 


G1548 


9253000 


1.60E-76 


Solanum tuberosum 


45 


G1548 


9820423 


1.40E-67 


Glycine max 


45 


G1548 


7570825 


8.40E-67 


Medicago truncatula 


45 


G1548 


9456848 


2.70E-55 


Lycopersicon esculentum 


45 


G1548 


9419425 


1.40E-47 


Hordeum vulgare 


45 


G1548 


6626571 


3.50E-46 


Zea mays 


45 


G1548 


8330425 


4.20E-46 


Mesembryanthemum crystallinum 


45 


G1548 


3853847 


2J0E-42 


Populus tremula x Populus tremuloides ! 
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MBI0018 Sequence Listing. ST25 
SEQUENCE LISTING 

<110> Riechmann, Jose Luis 
Reuber, Lynne 
Keddie, James 
Ratcliffe, Oliver 
Heard, Jacqueline 
Samaha, Raymond 
Yu, Guo- Liang 
Jiang, Cai-Zhong 

<120> Plant Developmental Genes 

<130> MBI-0018 

<150> 60/164,132 
<151> 1999-11-17 

<150> 60/197,899 

<151> 2000-04-17 

<150> Plant Trait Modification III 

<151> 2000-08-22 

<160> 46 

<170> Patentln version 3.0 

<210> 1 
<211> 2007 
<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> CDS 

<222> (43) . . (1977) 

<223> G727 

ct?cttctcc ttctctgatc gttcgttttc tggacgagag ag atg gta aat ccg 54 

1 

aat cac qqa aga gga ccc gat teg ggt act get get ggt ggg tea aac 
Gly His Gly Arg Gly Pro Asp Ser Gly Thr Ala Ala Gly Gly Ser Asn 
5 10 15 20 

tec gac ccg ttt ect gcg aat ctt cga gtt ctt gtc gtt gat gat gat 



ier Asp Pro Phe Pro Ala Asn Leu Arg Val Leu Val Val Asp Asp Asp 

25 30 35 

cca act tgt etc atg ate tta gag agg atg ctt atg act tgt etc tac 
Pro Thr Cys Leu Me? He Leu Glu Arg Met Leu Met Thr Cys Leu Tyr 
40 45 50 

aga gag cag aga gcg cat tgt etc tgc ttc gga aga aca aag aat ggt 
Arg Glu Gin Arg Ala His Cys Leu Cys Phe Gly Arg Thr Lys Asn Gly 
55 60 65 

ttt aat att gtc att agt gat gtt cat atg cct gac atg gat ggt ttc 
Phe Asp lie Val lie Ser Asp Val His Met Pro Asp Met Asp Gly Phe 
70 75 80 

aag etc ctt gaa cac gtt ggt tta gag atg gat tta cct gtt ate aat 
Lys Leu Leu Glu His Val Gly Leu Glu Met Asp Leu Pro val lie Asn 
85 90 ^5 100 

ctg aat gtt ttg aaa cct ttg gtt ata gtg atg tct gcg gat gat teg 
Leu Asn Val Leu Lys Pro Leu Val lie Val Met Ser Ala Asp Asp Ser 



102 



150 



105 HO H5 

aaa aqc qtt gtg ttg aaa gga gtg act cac ggt gca gtt gat tac etc 

Lys le* Val Val Leu Lys Gly Val Thr His Gly Ala Val Asp Tyr Leu 

120 125 130 



198 



246 



294 



342 



390 



438 



Page 1 



WO 01/36444 



PCT/US00/31325 



MBI0018 Sequence Listing. ST25 

ate aaa ccg gta cgt att gag get ttg aag aat ata tgg caa cat gtg 
lie Lys Pro Val Arg lie Glu Ala Leu Lys Asn lie Trp Gin His Val 
135 " 140 145 

gtg egg aag aag cgt aac gag tgg aat gtt tct gaa cat tct gga gga 
Val Arq Lys Lys Arg Asn Glu Trp Asn Val Ser Glu His Ser Gly Gly 
150 155 160 

agt att gaa gat act ggc ggt gac agg gac agg cag cag cag cat agg 
Ser lie Glu Asp Thr Gly Gly Asp Arg Asp Arg Gin Gin Gin His Arg 
165 " 170 175 180 

gag gat get gat aac aac teg tct tea gtt aat gaa ggg aac ggg agg 
Glu Asp Ala Asp Asn Asn Ser Ser Ser Val Asn Glu Gly Asn Gly Arg 
185 190 195 

age teg agg aag egg aag gaa gag gaa gta gat gat caa ggg gat gat 
Ser Ser Arg Lys Arg Lys Glu Glu Glu Val Asp Asp Gin Gly Asp Asp 
200 205 210 

aag gaa gac tea teg agt tta aag aaa cca cgc gtg gtt tgg tct gtt 
Lys Glu Asp Ser Ser Ser Leu Lys Lys Pro Arg Val Val Trp Ser Val 
215 220 225 

gaa ttg cat cag cag ttt gtt get get gtg aat cag eta ggc gtt gac 
Glu Leu His Gin Gin Phe Val Ala Ala Val Asn Gin Leu Gly Val Asp 
230 235 240 

agt gag tta aaa act tgc ttg ctt atg cat ttg tgt gtg teg att ggt 
Ser Glu Leu Lys Thr Cys Leu Leu Met His Leu Cys Val Ser lie Gly 
245 250 255 260 

aac att gtg gaa ttc cag aag tat egg ata tat ctg aga egg ctt gga 
Asn lie Val Glu Phe Gin Lys Tyr Arg He Tyr Leu Arg Arg Leu Gly 
265 270 275 

gga gta teg caa cac caa gga aat atg aac cat teg ttt atg act ggt 
Gly Val Ser Gin His Gin Gly Asn Met Asn His Ser Phe Met Thr Gly 
280 285 290 

caa gat cag agt ttt gga cct ctt tct teg ttg aat gga ttt gat ctt 
Gin Asp Gin Ser Phe Gly Pro Leu Ser Ser Leu Asn Gly Phe Asp Leu 
295 300 305 

caa tct tta get gtt act ggt cag etc cct cct cag age ctt gca cag 
Gin Ser Leu Ala Val Thr Gly Gin Leu Pro Pro Gin Ser Leu Ala Gin 
310 315 320 

ctt caa gca get ggt ctt ggc egg cct aca etc get aaa cca ggg atg 
Leu Gin Ala Ala Gly Leu Gly Arg Pro Thr Leu Ala Lys Pro Gly Met 
325 330 335 340 

teg gtt tct ccc ctt gta gat cag aga age ate ttc aac ttt gaa aac 
Ser Val Ser Pro Leu Val Asp Gin Arg Ser He Phe Asn Phe Glu Asn 
345 350 355 

cca aaa ata aga ttt gga gac gga cat ggt cag acg atg aac aat gga 
Pro Lys He Arg Phe Gly Asp Gly His Gly Gin Thr Met Asn Asn Gly 
360 365 370 

aat ttg ctt cat ggt gtc cca acg ggt agt cac atg cgt ctg cgt cct 
Asn Leu Leu His Gly Val Pro Thr Gly Ser His Met Arg Leu Arg Pro 
375. 380 385 

gga cag aat gtt cag age age gga atg atg ttg eca gta gca gac cag 
Gly Gin Asn Val Gin Ser Ser Gly Met Met Leu Pro Val Ala Asp Gin 
390 395 400 

eta cct cga gga gga cca teg atg eta cca tec etc ggg caa cag ccg 
Leu Pro Arg Gly Gly Pro Ser Met Leu Pro Ser Leu Gly Gin Gin Pro 
405 410 415 420 

ata ttg tea age age gtt tea aga aga age gat etc act ggt gcg ctg 
He Leu Ser Ser Ser Val Ser Arg Arg Ser Asp Leu Thr Gly Ala Leu 
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534 



582 



630 



678 



726 



774 



822 



870 



918 



966 



1014 



1062 



1110 



1158 



1206 



1254 



1302 



1350 
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425 430 435 

acq qtt aga aac agt ate ccc gag acc aac age aga gtg tta cca act 1398 
Ala Val Arg Asn Ser He Pro Glu Thr Asn Ser Arg Val Leu Pro Thr 
440 445 450 

act cac teg gtc ttc aat aac ttc ccc gcg gat eta cct cgc age age 1446 
Thr His Ser Val Phe Asn Asn Phe Pro Ala Asp Leu Pro Arg Ser Ser 
455 460 465 

ttc ccg ttg gca agt gec cca ggg att tea gtt cca gta tea gtt tct 1494 
Phe Pro Leu Ala Ser Ala Pro Gly He Ser Val Pro Val Ser Val Ser 
470 475 480 



565 570 



tga ttattactcg aatacagtgc actctaaaac 

<210> 2 
<211> 644 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 2 

Met Val Asn Pro Gly His Gly Arg Gly Pro Asp Ser Gly Thr Ala Ala 
15 10 15 

Gly Gly Ser Asn Ser Asp Pro Phe Pro Ala Asn Leu Arg Val Leu Val 
20 25 30 

Val Asp Asp Asp Pro Thr Cys Leu Met He Leu Glu Arg Met Leu Met 
35 40 45 



1542 



1590 



1638 



tac caa gaa gag gtc aac age teg gat gca aaa gga ggt tea tea get 
Tyr Gin Glu Glu Val Asn Ser Ser Asp Ala Lys Gly Gly Ser Ser Ala 
485 490 495 500 

get act get gga ttt ggt aac cca age tac gac ata ttt aac gat ttt 
Ala Thr Ala Gly Phe Gly Asn Pro Ser Tyr Asp He Phe Asn Asp Phe 
505 510 515 

ccg cag cac caa cag cac aac aag aac ate age aat aaa eta aac gat 
Pro Gin His Gin Gin His Asn Lys Asn He Ser Asn Lys Leu Asn Asp 
520 525 530 

tag gat ctg egg aat atg gga ttg gtc ttc agt tec aat cag gac gca 1686 
Trp Asp Leu Arg Asn Met Gly Leu Val Phe Ser Ser Asn Gin Asp Ala 
535 540 545 

gca act gca acc gca acc gca gca ttt tec act teg gaa gca tac tct 1734 
Ala Thr Ala Thr Ala Thr Ala Ala Phe Ser Thr Ser Glu Ala Tyr Ser 
550 555 560 

teg tct tct acg cag aga aaa aga egg gaa acg gac gca aca gtt gtg 1782 
Ser Ser Ser Thr Gin Arg Lys Arg Arg Glu Thr Asp Ala Thr Val Val 

m/\ C7C 580 



1830 



qqt gag cat ggg cag aac ctg cag tea ccg age egg aat ctg tat cat 
Gly Glu His Gly Gin Asn Leu Gin Ser Pro Ser Arg Asn Leu Tyr His 
1 585 590 595 

ctg aac cac gtt ttt atg gac ggt ggt tea gtc aga gtg aag tea gaa 1878 
Leu Asn His Val Phe Met Asp Gly Gly Ser Val Arg Val Lys Ser Glu 
600 605 610 

aga gtg gcg gag aca gtg act tgt cct cca gca aat aca ttg ttt cac 1926 
Arq val Ala Glu Thr Val Thr Cys Pro Pro Ala Asn Thr Leu Phe His 
615 620 625 

gag cag tat aat caa gaa gat ctg atg age gca ttt etc aaa cag gtt 1974 
Glu Gin Tyr Asn Gin Glu Asp Leu Met Ser Ala Phe Leu Lys Gin Val 
630 635 640 



2007 
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Thr Cys Leu Tyr Arg Glu Gin Arg Ala His Cys Leu Cys Phe Gly Arg 
50 55 60 

Thr Lys Asn Gly Phe Asp He Val He Ser Asp Val His Met Pro Asp 
65 70 75 80 

Met Asp Gly Phe Lys Leu Leu Glu His Val Gly Leu Glu Met Asp Leu 
85 90 95 

Pro Val lie Asn Leu Asn Val Leu Lys Pro Leu Val He Val Met Ser 
100 105 HO 

Ala Asp Asp Ser Lys Ser Val Val Leu Lys Gly Val Thr His Gly Ala 
115 120 125 

Val Asp Tyr Leu He Lys Pro Val Arg He Glu Ala Leu Lys Asn He 
130 135 140 

Trp Gin His Val Val Arg Lys Lys Arg Asn Glu Trp Asn Val Ser Glu 
145 150 155 160 

His Ser Gly Gly Ser He Glu Asp Thr Gly Gly Asp Arg Asp Arg Gin 
165 * 170 175 

Gin Gin His Arg Glu Asp Ala Asp Asn Asn Ser Ser Ser Val Asn Glu 
180 185 190 

Gly Asn Gly Arg Ser Ser Arg Lys Arg Lys Glu Glu Glu Val Asp Asp 
195 200 205 

Gin Gly Asp Asp Lys Glu Asp Ser Ser Ser Leu Lys Lys Pro Arg Val 
210 215 220 

Val Trp Ser Val Glu Leu His Gin Gin Phe Val Ala Ala Val Asn Gin 
225 230 235 240 

Leu Gly Val Asp Ser Glu Leu Lys Thr Cys Leu Leu Met His Leu Cys 
245 " 250 255 

Val Ser lie Gly Asn He Val Glu Phe Gin Lys Tyr Arg He Tyr Leu 
260 265 270 

Arg Arg Leu Gly Gly Val Ser Gin His Gin Gly Asn Met Asn His Ser 
275 280 285 

Phe Met Thr Gly Gin Asp Gin Ser Phe Gly Pro Leu Ser Ser Leu Asn 
290 295 . 300 

Gly Phe Asp Leu Gin Ser Leu Ala Val Thr Gly Gin Leu Pro Pro Gin 
305 310 315 320 

Ser Leu Ala Gin Leu Gin Ala Ala Gly Leu Gly Arg Pro Thr Leu Ala 
325 330 335 . 



Lys Pro Gly Met Ser Val Ser Pro Leu Val Asp Gin Arg Ser He Phe 
340 345 350 
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Asn Phe Glu Asn Pro. Lys He Arg Phe Gly Asp Gly His Gly Gin Thr 
355 360 365 

Met Asn Asn Gly Asn Leu Leu His Gly Val Pro Thr Gly Ser His Met 
370 375 380 

Ara Leu Arq Pro Gly Gin Asn Val Gin Ser Ser Gly Met Met Leu Pro 
385 390 395 400 

Val Ala Asp Gin Leu Pro Arg Gly Gly Pro Ser Met Leu Pro Ser Leu 
405 410 415 



Glv Gin Gin Pro He Leu Ser Ser Ser Val Ser Arg Arg Ser Asp Leu 
420 425 430 

Thr Gly Ala Leu Ala Val Arg Asn Ser He Pro Glu Thr Asn Ser Arg 
435 440 445 

val Leu Pro Thr Thr His Ser Val Phe Asn Asn Phe Pro Ala Asp Leu 
450 455 460 



Pro Arg Ser Ser Phe Pro Leu Ala Ser Ala Pro Gly lie Ser Val Pro 



465 



470 



475 



Val Ser Val Ser Tyr Gin Glu Glu Val Asn Ser Ser Asp Ala Lys Gly 
485 490 495 

Glv Ser Ser Ala Ala Thr Ala Gly Phe Gly Asn Pro Ser Tyr Asp He 
1 500 505 510 

Phe Asn Asp Phe Pro Gin His Gin Gin His Asn Lys Asn He Ser Asn 
515 520 525 

Lys Leu Asn Asp Trp Asp Leu Arg Asn Met Gly Leu Val Phe Ser Ser 
530 535 . 540 



Asn Gin Asp Ala Ala Thr Ala Thr Ala Thr Ala Ala Phe Ser Thr Ser 



545 



550 



555 



Glu Ala Tyr Ser Ser Ser Ser Thr Gin Arg Lys Arg Arg Glu Thr Asp 



565 



570 



Ala Thr Val Val Gly Glu His Gly Gin Asn Leu Gin Ser Pro Ser Arg 
580 585 590 



Asn Leu Tyr His Leu Asn His Val Phe Met Asp Gly Gly Ser Val Arg 
595 600 605 

val Lys Ser Glu Arg Val Ala Glu Thr Val Thr Cys Pro Pro Ala Asn 
610 615 620 



Thr Leu Phe His Glu Gin Tyr Asn Gin Glu Asp Leu Met Ser Ala Phe 
625 630 635 640 



Leu Lys Gin Val 



Page 5 



WO 01/36444 



PCT/US00/31325 



MBI0018 Sequence Listing. ST25 



<210> 3 

<211> 834 

<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> CDS 

<222> (73).. (588) 

<223> G732 



<400> 3 

aaaaaaacca aacataaaac ataaaactct gtcctttttt tgtcttcttg taacttttct 60 

tgttaaaaat ca atg gcg tea tct age age aca tac egg age tea age tct 111 
Met Ala Ser Ser Ser Ser Thr Tyr Arg Ser Ser Ser Ser 
15 10 

tec gac ggt ggt aat aat aac ccg teg gac tec gtc gtc acc gtc gac 159 
Ser Asp Gly Gly Asn Asn Asn Pro Ser Asp Ser Val Val Thr Val Asp 
15 20 25 

gaa cga aaa cgt aaa aga atg tta teg aac aga gaa tct gca cgt agg 207 
Glu Arg Lys Arg Lys Arg Met Leu Ser Asn Arg Glu Ser Ala Arg Arg 
30 ' 35 40 45 

tea agg atg cgt aaa cag aaa cac gtt gat gat eta acg get cag ate 255 
Ser Arg Met Arg Lys Gin Lys His Val Asp Asp Leu Thr Ala Gin He 
50 55 60 

aat cag eta tea aac gac aac cgt cag ate ttg aac age etc acc gta 303 
Asn Gin Leu Ser Asn Asp Asn Arg Gin He Leu Asn Ser Leu Thr Val 
65 70 75 

aca tct cag ctt tac atg aag ate caa gee gag aac tct gtt etc acc 351 
Thr Ser Gin Leu Tyr Met Lys He Gin Ala Glu Asn Ser Val Leu Thr 
80 85 90 

get cag atg gag gag ctt age acc aga etc caa tct etc aac gag ate 399 
Ala Gin Met Glu Glu Leu Ser Thr Arg Leu Gin Ser Leu Asn Glu He 
95 100 105 

gtt gat ctt gtt caa tec aac ggt gca gga ttt ggt gtt gac cag ate 447 
Val Asp Leu Val Gin Ser Asn Gly Ala Gly Phe Gly Val Asp Gin He 
110 115 120 125 

gac ggc tgt ggt ttt gat gat cgt acg gtt ggg ate gac gga tat tac 495 
Asp Gly Cys Gly Phe Asp Asp Arg Thr Val Gly He Asp Gly Tyr Tyr 
130 135 140 

gat gat atg aat atg atg agt aat gtt aat cat tgg ggt ggt teg gtt 543 
Asp Asp Met Asn Met Met Ser Asn Val Asn His Trp Gly Gly Ser Val 
145 150 155 

tac act aac caa ccc att atg get aat gat ate aat atg tat tga 588 
Tyr Thr Asn Gin Pro He Met Ala Asn Asp He Asn Met Tyr 
160 165 170 



ttaataaaat 


taattaaaat 


aattagatgc 


cccttttttg 


tctttttatt 


ttaaaattta 


648 


geccattttg 


gtgtttttgg gttggtgtga 


tgatgtaatt 


atagtacatg 


catctttgat 


708 


tggttggaag 


gataaatata 


aactttatat 


atatattggg 


gcatatatat 


atgagttgta 


768 


etttgeatgt 


attggtgtgt 


gttttgttat 


aattatatga 


ttatatatgt 


ttatgttaaa 


828 


aaaaaa 












834 


<210> 4 
<211> 171 
<212> PRT 
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<213> Arabidopsis thaliana 



<400> 4 

Met Ala Ser Ser Ser Ser Thr Tyr Ar 9 Ser Ser Ser Ser Ser Asp Gly 
1 5 10 10 



Gly Asn Asn Asn Pro Ser Asp Ser Val Val Thr Val Asp Glu Arg Lys 



20 



Arg Lys Arg Met Leu Ser Asn Arg Glu Ser Ala Arg Arg Ser Arg Met 
35 40 45 

Arg Lys Gin Lys His Val Asp Asp Leu Thr Ala Gin He Asn Gin Leu 
50 55 

Ser Asn Asp Asn Arg Gin lie Leu Asn Ser Leu Thr Val Thr Ser Gin 
65 7 ° 75 

Leu Tyr Met Lys lie Gin Ala Glu Asn Ser Val Leu Thr Ala Gin Met 
85 90 

Glu Glu Leu Ser Thr Arg Leu Gin Ser Leu Asn Glu lie Val Asp Leu 
100 105 110 

val Gin Ser Asn Gly Ala Gly Phe Gly Val Asp Gin lie Asp Gly Cys 
115 120 " b 

Gly Phe Asp Asp Arg Thr Vai Gly He Asp Gly Tyr Tyr Asp Asp Met 
130 1 35 X40 

Asn Met Met Ser Asn Val Asn His Trp Gly Gly Ser Val Tyr Thr Asn 
145 150 155 

Gin Pro He Met Ala Asn Asp He Asn Met Tyr 
165 170 



<210> 5 

<211> 1246 

<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> CDS 

<222> (81) . . (H39) 

<223> G9 



g^tttcttc tttctgctaa aaggttataa tttttgtttc ttggtttggt gagaatcttc 

aagaaactga aacaaagaaa atg gat tct agt tgc ata gac gag ata agt tec 
* 3 Met Asp Ser Ser Cys He Asp Glu lie ser &er 



1 5 

tec act tea qaa tct ttc tec gee ace acc gee aag aag etc tct cct 
Ser Thr Ser ctu Ser Phe Ser Ala Thr Thr Ala Lys Lys Leu Ser Pro 
15 20 25 

cct ccc qcg gcg gcg tta cgc etc tac egg atg gga age ggc ggg age 
Pro Pro III III 111 Leu Arg Leu Tyr Arg Met Gly Ser Gly Gly Ser 

30 35 
age gtc gtg ttg gat ccc gag aac ggc eta gag acg gag tea cga aag 
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Ser Val Val Leu Asp Pro Glu Asn Gly Leu Glu Thr Glu Ser Arg Lys 
45 50 55 

eta cca tct tea aaa tac aaa ggt gtt gtt cct cag cct aac gga aga 305 
Leu Pro Ser Ser Lys Tyr Lys Gly Val Val Pro Gin Pro Asn Gly Arg 
60 65 70 75 

tgg gga get cag ate tac gag aag cac caa cga gta tgg etc ggg act 353 
Trp Gly Ala Gin He Tyr Glu Lys His Gin Arg Val Trp Leu Gly Thr 
80 85 90 

ttc aac gag caa gaa gaa get get cgt tec tac gac ate gca get tgt 401 
Phe Asn Glu Gin Glu Glu Ala Ala Arg Ser Tyr Asp He Ala Ala Cys 
95 100 105 

aga ttc cgt ggc cgc gac gee gtc gtc aac ttc aag aac gtt ctg gaa 449 
Arg Phe Arg Gly Arg Asp Ala Val Val Asn Phe Lys Asn Val Leu Glu 
110 115 120 

gac ggc gat tta get ttt ctt gaa get cac tea aag gee gag ate gtc 497 
Asp Gly Asp Leu Ala Phe Leu Glu Ala His Ser Lys Ala Glu He Val 
125 130 135 

gac atg ttg aga aaa cac act tac gee gac gag ctt gaa cag aac aat 545 
Asp Met Leu Arg Lys His Thr Tyr Ala Asp Glu Leu Glu Gin Asn Asn 
140 145 150 155 

aaa egg cag ttg ttt etc tec gtc gac get aac gga aaa cgt aac gga 593 
Lys Arg Gin Leu Phe Leu Ser Val Asp Ala Asn Gly Lys Arg Asn Gly 
160 165 170 

teg agt act act caa aac gac aaa gtt tta aag acg tgt gaa gtt ctt 641 
Ser Ser Thr Thr Gin Asn Asp Lys Val Leu Lys Thr Cys Glu Val Leu 
175 180 185 

ttc gag aag get gtt aca cct age gac gtt ggg aag eta aac cgt etc 689 
Phe Glu Lys Ala Val Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu 
190 195 200 

gtg ata cct aaa caa cac gee gag aaa cac ttt ccg tta ccg tea ccg 737 
Val He Pro Lys Gin His Ala Glu Lys His Phe Pro Leu Pro Ser Pro 
205 210 215 

tea ccg gca gtg act aaa gga gtt ttg ate aac ttc gaa gac gtt aac 785 
Ser Pro Ala Val Thr Lys Gly Val Leu lie Asn Phe Glu Asp Val Asn 
220 225 ~ 230 235 

ggt aaa gtg tgg agg ttc cgt tac tea tac tgg aac agt agt caa agt 833 
Gly Lys Val Trp Arg Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gin Ser 
240 245 250 

tac gtg ttg ace aag gga tgg agt cga ttc gtc aag gag aag aat ctt 881 
Tyr Val Leu Thr Lys Gly Trp Ser Arg Phe Val Lys Glu Lys Asn Leu 
255 260 265 

cga gee ggt gat gtt gtt act ttc gag aga teg acc gga eta gag egg 929 
Arg Ala Gly Asp Val Val Thr Phe Glu Arg Ser Thr Gly Leu Glu Arg 
270 275 280 

cag tta tat att gat tgg aaa gtt egg tct ggt ccg aga gaa aac ccg 977 
Gin Leu Tyr He Asp Trp Lys Val Arg Ser Gly Pro Arg Glu Asn Pro 
285 290 295 

gtt cag gtg gtg gtt egg ctt ttc gga gtt gat ate ttt aat gtg acc 1025 
Val Gin Val Val Val Arg Lieu Phe Gly Val Asp He Phe Asn Val Thr 
300 305 310 315 

acc gtg aag cca aac gac gtc gtg gec gtt tgc ggt gga aag aga tct 1073 
Thr Val Lys Pro Asn Asp Val Val Ala Val Cys Gly Gly Lys Arg Ser 
320 325 330 

cga gat gtt gat gat atg ttt gcg tta egg tgt tec aag aag cag gcg 1121 
Arg Asp Val Asp Asp Met Phe Ala Leu Arg Cys Ser Lys Lys Gin Ala 
335 . 340 345 
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ata ate aat get ttg tga catatttcct tttccgattt tatgettteg 
He He Asn Ala Leu 
350 

ttttttaatt tttttttttg tcaagttgtg taggttgtga ttcatgetag gttgtattta 1229 

1246 

ggaaaagaga taagacc 



<210> 6 

<211> 352 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 6 

Met Asp Ser Ser Cys He Asp Glu He Ser Ser Ser Thr Ser Glu Ser 
1 5 



Phe Ser Ala Thr Thr Ala Lys Lys Leu Ser Pro Pro Pro Ala Ala Ala 
20 25 

Arg Leu Tyr Arg Met Gly Ser Gly Gly Ser Ser Val Val Leu Asp 

40 



Leu 



35 



Pro Glu Asn Gly Leu Glu Thr Glu Ser Arg Lys Leu Pro Ser Ser Lys 



50 



55 



Tyr Lys Gly Val Val Pro Gin Pro Asn Gly Arg Trp Gly Ala Gin lie 
65 * 70 75 

Tyr Glu Lys His Gin Arg Val Trp Leu Gly Thr Phe Asn Glu Gin Glu 
8 5 

Glu Ala Ala Arg Ser Tyr Asp He Ala Ala Cys Arg Phe Arg Gly Arg 

100 105 



Asp Ala val Val Asn Phe Lys Asn Val Leu Glu Asp Gly Asp Leu Ala 

Phe Leu Glu Ala His Ser Lys Ala Glu lie Val Asp Met Leu Arg Lys 

His Thr Tyr Ala Asp Glu Leu Glu Gin Asn Asn Lys Arg Gin Leu Phe 
145 "0 155 

L eu ser Val Asp Ala Asn Gly Lys Arg Asn Gly Ser Ser Thr Thr Gin 



AS n Asp Lye Val Leu Lys Thr Cys Glu Val Leu Phe Glu Lys Ala Val 

He 

200 205 



180 185 
Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val lie Pro Lys Gin 



195 



His Ala Glu Lys His Phe Pro Leu Pro Ser Pro Ser Pro Ala Val Thr 
210 21S 220 

Gly Val Leu He Asn Phe Glu Asp Val Asn Gly Lys Val Trp Arg 



210 

Lys Gly val ueu uo «=" ,,, 
225 230 * i:> 
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Phe Arg Tyr Ser Tyr Trp Asn Ser Ser Gin Ser Tyr Val Leu Thr Lys 

245 250 255 

Gly Trp Ser Arg Phe Val Lys Glu Lys Asn Leu Arg Ala Gly Asp Val 

260 265 270 

Val Thr Phe Glu Arg Ser Thr Gly Leu Glu Arg Gin Leu Tyr lie Asp 

275 280 285 

Trp Lys Val Arg Ser Gly Pro Arg Glu Asn Pro Val Gin Val Val Val 

290 " 295 300 

Arg Leu Phe Gly Val Asp lie Phe Asn Val Thr Thr Val Lys Pro Asn 

305 310 315 320 

Asp Val Val Ala Val Cys Gly Gly Lys Arg Ser Arg Asp Val Asp Asp 

325 330 335 

Met Phe Ala Leu Arg Cys Ser Lys Lys Gin Ala He He Asn Ala Leu 

340 345 350 



<210> 


7 


<211> 


1379 


<212> 


DNA 


<213> 


Arabidopsis thaliana 


<220> 




<221> 


CDS 


<222> 


(97) . . (1032) 


<223> 


G428 


<400> 


7 



ttacttttgt gtttcttcat attcttcaga agcaagcaca aggctaggga tcgaagaagc 60 

ggcgatcact gatcgtatct cactacgatc acatta atg gat aga atg tgt ggt 114 

Met Asp Arg Met Cys Gly 
1 5 



ttc cgc teg acg gaa gac tat teg gag aaa gcg acg ttg atg atg ccg 
Phe Arg Ser Thr Glu Asp Tyr Ser Glu Lys Ala Thr Leu Met Met Pro 
10 15 20 



ctg ttt gga tec gac gaa etc get ace get ttg tec teg gag ttg ctt 

Leu Phe Gly Ser Asp Glu Leu Ala Thr Ala Leu Ser Ser Glu Leu Leu 

40 ... 45 50 

ccg cgt att cga aaa get gag gat aat ttc tct ctt agt gtc ate aaa 

Pro Arg He Arg Lys Ala Glu Asp Asn Phe Ser Leu Ser Val He Lys 
55 60 65 70 



ate gat tgc caa aag gtg gga gcg cct atg gaa ata gcg tgt ata ttg 
He Asp Cys Gin LyB Val Gly Ala Pro Met Glu He Ala Cys He Leu 
90 95 100 



162 



tec gat tat cag tct ttg att tgt tea acc acc gga gac aat caa aga 210 
Ser Asp Tyr Gin Ser Leu He Cys Ser Thr Thr Gly Asp Asn Gin Arg 
25 30 35 



258 



306 



tec aaa ate get tct cat cct ttg tat cct cgc tta etc caa acc tac 354 
Ser Lys He Ala Ser His Pro Leu Tyr Pro Arg Leu Leu Gin Thr Tyr 
75 80 85 



402 



gaa gag att cag cga gag aac cat gtg tac aag aga gat gtt get cca 4 50 

Glu Glu He Gin Arg Glu Asn His Val Tyr Lys Arg Asp Val Ala Pro 
105 HO 115 
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fta tct tac ttt qga get gat cct gag ctt gat gaa ttc atg gaa acc 
£u Ser Pne Sy La Asp Pro Glu Leu Asp Glu Phe Met Glu Thr 
120 125 130 

♦-or. oat- ata ttq qtt aaa tac aaa acc gat ctt gcg agg ccg ttc 
Sr §8 lap Ue Leu wll Lys Tyr Lys Thr Asp Leu Ala Arg Pro Phe 
135 140 

b iii k s s a a a a s a a a s a a 

155 160 

, M tac act qqt cca gcg tct get aca get ctt tea gat gat ggt gcg 
HI Cys Tnr % Pro Ala Ser Ala Thr Ala Leu Ser Asp Asp Gly Ala 



155 160 

cca gcg tct get aca 
Pro Ala Ser Ala Thr 
170 175 

att tea tct qac gag gaa ctg aga gaa gat gat gac ata gca gcg gat 
fal Ser Ser Asp Glu Glu Leu Arg Glu Asp Asp Asp lie Ala Ala Asp 
185 I 90 195 

gac age caa caa aga age aat gac cgc gat ctg aag gac cag eta eta 
Asp Sir Gin Gin Arg Ser Asn Asp Arg Asp Leu Lys Asp Gin Leu Leu 
* 200 205 210 



3i a a © s s s 85 a a a a b a a a 

215 220 225 

IS 5S S S i 5! S - S K K SI S K 3 S 

e a a a e s a 5 s - a a a js is; is 

250 255 

a a a a s e a a a a e a a a a a 

265 270 

_ ♦-♦-i- ara sac caa aqq aaa cqc cat tgg aag cct teg gag aac 

SS T 9 4 Phe ?" Asn Sin lr| Lys A?g His Trp Lys Pro Ser Glu Asn 
280 285 

a a a tip a a s a a a a a a a a as 

295 30° 305 

gaa tga aaagagagac atgggattgt gcattgtata atttttacac tgttttccca 



Glu 



agaaaagaaa acagtaaaaa gcttttggta aatgggacat catcgegaat gaatggaaec 
agttagccaa aacggtcaag ggcgtggcgt aacgagacat tgtattggaa atagtggcaa 
tattatgtca ctaatcttcc aatggtccaa aatgatagat ttcttatttg tattgaacct 
tacttagata gctgatgtgt caactaaata atttattttc atccttatac tacttgtatc 
aatgtctcta attgatcaat tgttgcttgc tattcaaaaa aaaaaaaaaa aaaaaaa 

<210> 8 
<211> 311 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 8 

Met Asp Arg Met Cys Gly Phe Arg Ser Thr Glu Asp Tyr Ser Glu Lys 
1 5 1° 

Ala Thr Leu Met Met Pro Ser Asp Tyr Gin Ser Leu He Cys Ser Thr 
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498 

546 

594 

642 

690 

738 

786 
834 
882 
930 
978 

1026 

1082 

1142 
1202 
1262 
1322 
1379 
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20 25 30 

Thr Gly Asp Asn Gin Arg Leu Phe Gly Ser Asp Glu Leu Ala Thr Ala 
35 " 40 45 

Leu Ser Ser Glu Leu Leu Pro Arg lie Arg Lys Ala Glu Asp Asn Phe 
50 55 60 

Ser Leu Ser Val He Lys Ser Lys He Ala Ser His Pro Leu Tyr Pro 
65 70 75 80 

Arg Leu Leu Gin Thr Tyr lie Asp Cys Gin Lys Val Gly Ala Pro Met 
85 90 95 

Glu lie Ala Cys He Leu Glu Glu He Gin Arg Glu Asn His Val Tyr 
100 105 HO 

Lys Arg Asp Val Ala Pro Leu Ser Cys Phe Gly Ala Asp Pro Glu Leu 
115 120 125 

Asp Glu Phe Met Glu Thr Tyr Cys Asp He Leu Val Lys Tyr Lys Thr 
130 135 140 

Asp Leu Ala Arg Pro Phe Asp Glu Ala Thr Thr Phe He Asn Lys He 
145 150 155 160 

Glu Met Gin Leu Gin Asn Leu Cys Thr Gly Pro Ala Ser Ala Thr Ala 
165 170 175 

Leu Ser Asp Asp Gly Ala Val Ser Ser Asp Glu Glu Leu Arg Glu Asp 
180 185 190 

Asp Asp lie Ala Ala Asp Asp Ser Gin Gin Arg Ser Asn Asp Arg Asp 
195 200 205 

Leu Lys Asp Gin Leu Leu Arg Lys Phe Gly Ser His He Ser Ser Leu 
210 215 220 

Lys Leu Glu Phe Ser Lys Lys Lys Lys Lys Gly Lys Leu Pro Arg Glu 
225 230 235 240 

Ala Arg Gin Ala Leu Leu Asp Trp Trp Asn Val His Asn Lys Trp Pro 
245 250 255 

Tyr Pro Thr Glu Gly Asp Lys He Ala Leu Ala Glu Glu Thr Gly Leu 
260 265 270 

Asp Gin Lys Gin He Asn Asn Trp Phe He Asn Gin Arg Lys Arg His 
275 280 285 

Trp Lys Pro Ser Glu Asn Met Pro Phe Asp Met Met Asp Asp Ser Asn 
290 295 300 

Glu Thr Phe Phe Thr Glu Glu 
305 310 
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<210> 9 
<211> 1571 
<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> CDS 

<222> (428) . . (1402) 

<223> G869 

agjaacaltg aaaggttcgg ttttttgggt ttcgatctga taatcaacaa gaaaaaaggg 60 

tttgatttat gtcggctggg tttgaatcga ctgtgatttt gtctttgatt catatctctt 120 

ctccgatttc atcatcatct tccccatcat cgtcgtcttt gaaatcttgt cttctcaacg 

ctcttcactt ctgctgtaat aagcagaggc ttgttctgga gactccttct ctttccatgc 

gcttaagacc caaaaggact tgttctagtg ttgaagtctt tgggggtttt cacataaagc 

agcaaaagtt ttcttttttc atagttcgct gagagttttg agttttgata ccaaaaaagt 

tttgaccttt tagagtgatt ttttgttctt tctgttttct gggtattttt gaggagtggg 

tttaaca atq qtt qcg att aga aag gaa cag tct ttg agt ggt gtt agt 
tttaaca atg gtt gcg ^ ^ ^ ^ ^ ger Leu ger Qly Val Ser 



1 



5 10 



age gag att aag aag aga get aag aga aac act eta teg tec ctt cct 
sir III He Lyl Lys Arg Ala Lys Arg Asn Thr Leu Ser Ser Leu Pro 
15 20 25 30 

caa qaa acc caa cct ttg agg aaa gtc cgt att att gtg aat gat cct 
G?n gIu Thr tin Pro Leu Jllg Lys val Arg He lie Val Asn Asp Pro 

40 45 



tat get act gat gat tec tct agt gat gag gaa gag ctt aag gtt cct 
Tyr Ala Thr Asp Asp Ser Ser Ser Asp Glu Glu Glu Leu Lys Val Pro 
50 55 60 

aaq cca agg aaa atg aaa cgt ate gtt cgt gag att aac ttt cct tct 
Lyl Pro S Lys Me! Lys Arg lie Val Arg Glu lie Asn Phe Pro Ser 
65 70 75 

ato qaa qtt tct gaa cag cct tct gag agt tct tct cag gac agt act 
Me? Ill val Ser Glu Gin Pro Ser Glu Ser Ser Ser Gin Asp Ser Thr 
80 85 90 

aaa act oat qqc aag ata get gtg tea get tct cct get gtt cct agg 
Lys Thr Asp Gly Lys He Ala Val Ser Ala Ser Pro Ala Val Pro Arg 
95 100 105 110 

aaa aaa cct qtt ggt gtt agg caa agg aaa tgg ggg aaa tgg get get 
Lyl Lyl Pro Val Gly Val Ar5 Gin Arg Lys Trp Gly Lys Trp Ala Ala 
1 ' xi5 120 15 

gag att aga gat cct att aag aaa act agg act tgg ttg ggt act ttt 
Glu lie Arg Asp Pro lie Lys Lys Thr Arg Thr Trp Leu Gly Thr Phe 
130 135 

aat act ctt gaa gaa get get aaa get tat gat get aag aag ctt gag 
Asp Thr Leu Glu Glu Ala Ala Lys Ala Tyr Asp Ala Lys Lys Leu Glu 
145 150 155 

ttt cat qct att qtt get gga aat gtg tec act act aaa cgt gat gtt 
Phe Asp Ala Ue Val Ala Gly Asn Val Ser Thr Thr Lys Arg Asp Val 
160 165 170 

tct tea tct qag act age caa tgc tct cgt tct tea cct gtt gtt cct 
Ser Ser Ser Gil Thr sir Gin C?s Ser Arg Ser Ser Pro Val Val Pro 
175 180 185 

gtt gag caa gat gac act tct gca tea get etc act tgt gtc aac aac 
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180 
240 
300 
360 
420 
469 



517 



565 



613 



661 



709 



757 



805 



853 



901 



949 



997 



1045 
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val 


Glu 


Gin 


Asp Asp. Thr 
195 


Ser 


Ala 


Ser 


Ala 
200 


Leu 


Thr 


Cys 


Val 


Asn Asn 
205 




cct 
Pro 


gat 
Asp 


gac 
Asp 


gtc teg acc 
Val Ser Thr 
210 


gtt 
Val 


get 
Ala 


cca 
Pro 
215 


act 
Thr 


get 
Ala 


cca 
Pro 


act 
Thr 


cca 
Pro 
220 


aat gtt 
Asn Val 


1093 


cct 
Pro 


get 
Ala 


ggt 
Gly 
225 


gga aac aag 
Gly Asn Lys 


gaa 
Glu 


acg 
Thr 
230 


ttg 
Leu 


ttc 
Phe 


gat 
Asp 


ttc 
Phe 


gac 
Asp 
235 


ttt 
Phe 


act aat 
Thr Asn 


1141 


eta 
Leu 


cag 
Gin 
240 


ate 
He 


cct gat ttt 
Pro Asp Phe 


ggt 
Gly 
245 


ttc 
Phe 


ttg 
Leu 


gca 
Ala 


gag 
Glu 


gag 
Glu 
250 


caa 
Gin 


caa 
Gin 


gac eta 
Asp Leu 


1189 


gac 
Asp 
255 


ttc 
Phe 


gat 
Asp 


tgt ttc etc 
Cys Phe Leu 
260 


gcg 
Ala 


gat 
Asp 


gat 

Asp 


cag 
Gin 


ttt 
Phe 
265 


gat 
Asp 


gat 
Asp 


ttc 
Phe 


ggc ttg 
Gly Leu 

270 


1237 


ctt 
Leu 


gat 
Asp 


gac 
Asp 


att caa gga 
He Gin Gly 
275 


ttc 
Phe 


qaa 
Glu 


gat 
Asp 


aac 
Asn 
280 


ggt 
Gly 


cca 
Pro 


agt 
Ser 


gcg 
Ala 


tta cca 
Leu Pro 
285 


1285 


gat 

Asp 


ttc 
Phe 


gac 

ASp 


ttt gcg gat 
Phe Ala Asp 
290 


gtt 
Val 


gaa 
Glu 


gat 
Asp 
295 


Ctt 
Leu 


cag 
Gin 


eta 
Leu 


get 
Ala 


gac 
Asp 
300 


tct agt 
Ser Ser 


1333 


ttc 
Phe 


ggt 
Gly 


ttc 
Phe 
305 


ctt gat caa 
Leu Asp Gin 


ctt 
Leu 


get 
Ala 
310 


cct 
Pro 


ate 
He 


aac 
Asn 


ate 
lie 


tct 
Ser 
315 


tgc 
Cys 


cca tta 
Pro Leu 


1381 



aaa agt ttt gca get tea tag gatcttgett agtaatgtta agtgagaaga 1432 
Lys Ser Phe Ala Ala Ser 
320 

gtgttttgtt ttttcgttta tgctttagta atttaagaca tacaaaagtg tgtgttccgg 1492 

attgtagtaa gatcttaaga cataaagecg ggttttgcaa ttaggaatcg agttttaatg 1552 

aagttttagt ttatgtttg 1571 

<210> 10 

<211> 324 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 10 

Met Val Ala He Arg Lys Glu Gin Ser Leu Ser Gly Val Ser Ser Glu 
1 5 10 15 

He Lys Lys Arg Ala Lys Arg Asn Thr Leu Ser Ser Leu Pro Gin Glu 
20 " 25 30 

Thr Gin Pro Leu Arg Lys Val Arg He He Val Asn Asp Pro Tyr Ala 
35 40 45 

Thr Asp Asp Ser Ser Ser Asp Glu Glu Glu Leu Lys Val Pro Lys Pro 
50 55 60 

Arg Lys Met Lys Arg He Val Arg Glu He Asn Phe Pro Ser Met Glu 
65 70 75 80 

Val Ser Glu Gin Pro Ser Glu Ser Ser Ser Gin Asp Ser Thr Lys Thr 
85 90 95 

Asp Gly Lys He Ala Val Ser Ala Ser Pro Ala Val Pro Arg Lys Lys 
100 105 110 
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Pro Val Gly Val Arg Gin Arg Lys Trp Gly Lys Trp Ala Ala Glu He 



115 



120 125 



Ara Asp Pro lie Lys Lys Thr Arg Thr Trp Leu Gly Thr Phe Asp Thr 
130 140 

Leu Glu Glu Ala Ala Lys Ala Tyr Asp Ala Lys Lys Leu Glu Phe Asp 
145 150 155 160 

Ala He Val Ala Gly Asri Val Ser Thr Thr Lys Arg Asp Val Ser Ser 
165 170 l" 5 

Ser Glu Thr Ser Gin Cys Ser Arg Ser Ser Pro Val Val Pro Val Glu 
180 185 190 

Gin Asp Asp Thr Ser Ala Ser Ala Leu Thr Cys Val Asn Asn Pro Asp 
195 200 205 

Asp val Ser Thr Val Ala Pro Thr Ala Pro Thr Pro Asn Val Pro Ala 
210 215 220 

Gly Gly Asn Lys Glu Thr Leu Phe Asp Phe Asp Phe Thr Asn Leu Gin 
225 230 235 240 

He Pro Asp Phe Gly Phe Leu Ala Glu Glu Gin Gin Asp Leu Asp Phe 
245 250 255 

Asp Cys Phe Leu Ala Asp Asp Gin Phe Asp Asp Phe Gly Leu Leu Asp 
260 265 270 

Asp He Gin Gly Phe Glu Asp Asn Gly Pro Ser Ala Leu Pro Asp Phe 
£ 275 280 285 

Asp Phe Ala Asp Val Glu Asp Leu Gin Leu Ala Asp Ser Ser Phe Gly 
290 295 300 

Phe Leu Asp Gin Leu Ala Pro He Asn He Ser Cys Pro Leu Lys Ser 
305 310 

Phe Ala Ala Ser 



<210> 


11 


<211> 


1166 


<212> 


DNA 


<213> 


Arabidopsis thaliana 


<220> 




<221> 


CDS 


<222> 


(88) . . (951) 


<223> 


G1269 


<400> 


11 



aacaattctc tctctcttta ttcttcttct tcagcttcag atttcagatc ttaaatcttc 60 

aagtcttctt cttcttcttc tgcaacc atg get atg cag gaa cgt tgt gag agt 114 
y Met Ala Met Gin Glu Arg Cys Glu Ser 

1 5 
Page 15 



WO 01/36444 



PCT/US00/31325 



MBI0018 Sequence Listing. ST25 

tta tgt tct gat gaa ctt ata tct tec tea gat gee ttt tac etc aag 

Leu Cys Ser Asp Glu Leu lie Ser Ser Ser Asp Ala Phe Tyr Leu Lys 
10 15 20 25 

aca aga aag cct tat ace ate act aaa caa aga gag aaa tgg aca gaa 

Thr Arg Lys Pro Tyr Thr He Thr Lys Gin Arg Glu Lys Trp Thr Glu 

30 35 40 

gca gag cat gag aag ttt gta gaa gca ttg aaa etc tat ggc aga get 

Ala Glu His Glu Lys Phe Val Glu Ala Leu Lys Leu Tyr Gly Arg Ala 
45 50 55 

tgg aga cga ate gaa gaa cat gtt gga aca aaa act gca gtt cag att 
Trp Arg Arg He Glu Glu His Val Gly Thr Lys Thr Ala Val Gin He 
60 65 70 



gtt age tct gag tec att gag ate ccg cct cca agg cca aag aga aag 
val Ser Ser Glu Ser He Glu He Pro Pro Pro Arg Pro Lys Arg Lys 
90 95 100 105 



tct cac aca gag gaa tea ttg tct eta gaa gca gag ace aaa cag age 
Ser His Thr Glu Glu Ser Leu Ser Leu Glu Ala Glu Thr Lys Gin Ser 
170 175 180 185 



gaa gtg gta gtg teg gag ttc aaa aga agt gag aga tea get ttc tct 
Glu Val Val Val Ser Glu Phe Lys Arg Ser Glu Arg Ser Ala Phe Ser 
235 240 245 



162 



210 



258 



306 



cga age cat gcg cag aag ttc ttt act aag gtt get cgc gat ttt ggt 354 
Arg Ser His Ala Gin Lys Phe Phe Thr Lys Val Ala Arg Asp Phe Gly 
75 80 85 



402 



ccg atg cat cct tac cct aga aag ctt gtg att cct gat gca aaa gag 450 

Pro Met His Pro Tyr Pro Arg Lys Leu Val lie Pro Asp Ala Lys Glu 

110 115 120 

atg gta tac get gaa eta ace gga tec aag ctg att cag gat gaa gat 498 

Met Val Tyr Ala Glu Leu Thr Gly Ser Lys Leu He Gin Asp Glu Asp 

125 130 135 

aac cga tct cca aca teg gtt tta tea get cat ggc tea gat gga tta 546 

Asn Arg Ser Pro Thr Ser Val Leu Ser Ala His Gly Ser Asp Gly Leu 

140 145 150 

ggt tec att ggt tea aat tea cct aac tct tct tea get gag tta tea 594 

Gly Ser He Gly Ser Asn Ser Pro Asn Ser Ser Ser Ala Glu Leu Ser 

155 160 165 



642 



ctt aag etc ttt gga aaa act ttt gta gtt ggt gat tac aac tct tea 690 

Leu Lys Leu Phe Gly Lys Thr Phe Val Val Gly Asp Tyr Asn Ser Ser 
190 195 200 

atg agt tgt gat gat tct gaa gat ggc aag aag aag eta tac tea gaa 738 

Met Ser Cys Asp Asp Ser Glu Asp Gly Lys Lys Lys Leu Tyr Ser Glu 
205 210 215 

aca cag tct ctt caa tgt tct tct tct act tea gaa aac get gaa aca 786 

Thr Gin Ser Leu Gin Cys Ser Ser Ser Thr Ser Glu Asn Ala Glu Thr 
220 225 230 



834 



cag tta aaa teg teg gtg act gag atg aac aac atg aga ggg ttc atg 882 
Gin Leu Lys Ser Ser Val Thr Glu Met Asn Asn Met Arg Gly Phe Met. 
250 255 260 265 

cct tac aaa aag aga gta aag gtg gaa gaa aac att gac aat gta aaa 930 
Pro Tyr Lys Lys Arg Val Lys Val Glu Glu Asn He Asp Asn Val Lys 
270 275 280 

tta tea tat cct ttg tgg tga agtgttcgtt tgtgtcaagt cagttgtgta 981 
Leu Ser Tyr Pro Leu Trp 
285 

aactcttttg atctcaacat cagattatgt gtataatgtc agagtattag ggaaagtttt 1041 
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tttggattag attcgtaaga tcactccaaa gttt^tgtc tttccatata accagttaga 1101 
aattgagatc cttgtactta aacattttta tttgatcaat caaatcttct tgatgaaaaa 1161 

1166 

aaaaa 

<210> 12 

<211> 287 

<212> PRT . 

<213> Arabidopsis thaliana 

<400> 12 

Met Ala Met Gin Glu Arg Cys Glu Ser Leu Cys Ser Asp Glu Leu lie 
1 5 10 

Ser Ser Ser Asp Ala Phe Tyr Leu Lys Thr Arg Lys Pro Tyr Thr lie 
20 25 



Thr Lys Gin Arg Glu Lys Trp Thr Glu Ala Glu His Glu Lys Phe Val 
35 40 

Glu Ala Leu Lys Leu Tyr Gly Arg Ala Trp Arg Arg He Glu Glu His 
50 55 

val Gly Thr Lys Thr Ala Val Gin lie Arg Ser His Ala Gin Lys Phe 
65 70 

Phe Thr Lys Val Ala Arg Asp Phe Gly Val Ser Ser Glu Ser lie Glu 



85 90 



100 



He Pro Pro Pro Arg Pro Lys Arg Lys Pro Met His Pro Tyr Pro Arg 



105 



Lys Leu val lie Pro Asp Ala Lys Glu Met Val Tyr Ala Glu Leu Thr 
115 120 

Gly ser Lys Leu lie Gin Asp Glu Asp Asn Arg Ser Pro Thr Ser Val 
130 135 

Leu Ser Ala His Gly Ser Asp Gly Leu Gly Ser lie Gly Ser Asn Ser 
145 150 

Pro Asn Ser Ser Ser Ala Glu Leu Ser Ser His Thr Glu Glu Ser Leu 
165 

Ser Leu Glu Ala Glu Thr Lys Gin Ser Leu Lys Leu Phe Gly Lys Thr 
180 185 

Phe val val Gly Asp Tyr Asn Ser Ser Met Ser Cys Asp Asp Ser Glu 
195 200 

Asp Gly Lys Lys Lys Leu Tyr Ser Glu Thr Gin Ser Leu Gin Cys Ser 



210 215 



^ *. « rin Thr Glu Val Val Val Ser Glu Phe 
Ser Ser Thr Ser Glu Asn Ala Glu Thr fciu vax ^ 



225 



230 



Lys Arg Ser Glu Arg Ser Ala Phe Ser Gin Leu Lys Ser Ser Val Thr 
y Page 17 



WO 01/36444 



PCT/US00/31325 



MBI0018 Sequence Listing. ST25 
245 250 255 

Glu Met Asn Asn Met Arg Gly Phe Met Pro Tyr Lys Lys Arg Val Lys 
260 265 270 

Val Glu Glu Asn lie Asp Asn Val Lys Leu Ser Tyr Pro Leu Trp 
275 280 285 

<210> 13 

<211> 2031 

<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> CDS 

<222> (240) (1574) 

<223> G1038 

<400> 13 

gctcgttttc aaattaaaaa cagggagaaa tttggaaatt ccagtacgac gggagataaa 60 

acctaacata cgccatggtg accgttatct aaactacgcc aaaatatttg aagtgtcgtc 120 

gtttcataat aaaacgcaaa caaaaaccca ctcccacttt ctcctttcca aaaaaagaac 180 

tctcgccact ttctctgctc ttttctttct ctctctcttt cttgttttcg ccggcgatc 239 

atg gag aaa age ggc ttc tct ccc gtc ggt eta agg gtt ctt gtc gta 287 
Met Glu Lys Ser Gly Phe Ser Pro Val Gly Leu Arg Val Leu Val Val 
1 5 10 15 

gac gat gat cca act tgg etc aag att etc gag aaa atg etc aag aag 335 
Asp Asp Asp Pro Thr Trp Leu Lys lie Leu Glu Lys Met Leu Lys Lys 
20 25 30 

tgt tct tac gaa gta acg acc tgt gga tta get aga gag get ttg agg 383 . 

Cys Ser Tyr Glu Val Thr Thr Cys Gly Leu Ala Arg Glu Ala Leu Arg 
35 40 45 

ttg ctg agg gag cgt aaa gat gga tat gat ate gtg ate age gat gtg 431 
Leu Leu Arg Glu Arg Lys Asp Gly Tyr Asp lie Val He Ser Asp Val 
50 55 60 

aac atg cct gac atg gat ggt ttc aag ctt ctt gag cat gtt ggt ctt 479 
Asn Met Pro Asp Met Asp Gly Phe Lys Leu Leu Glu His Val Gly Leu 
65 70 75 80 

gaa tta gac etc cct gta ata atg atg teg gtg gac ggc gaa aca age 527 
Glu Leu Asp Leu Pro Val He Met Met Ser Val Asp Gly Glu Thr Ser 
85 90 95 

cga gtg atg aag gga gtg cac acg gga get tgt gat tac etc ttg aag 575 
Arg Val Met Lys Gly Val His Thr Gly Ala Cys Asp Tyr Leu Leu Lys 
100 105 110 

ccg ata aga atg aag gag tta aag att ata tgg caa cat gtt ctg aga 623 
Pro He Arg Met Lys Glu Leu Lys He He Trp Gin HiB Val Leu Arg 
115 120 125 

aag aag ctt caa gaa gtg aga gat ate gaa ggc tgt gga tac gaa gga 671 
Lys Lys Leu Gin Glu Val Arg Asp He Glu Gly Cys Gly Tyr Glu Gly 
130 135 140 

gga gcg gat tgg ate act cga tac gat gaa gca cat ttt ctt gga ggt 719 
Gly Ala Asp Trp He Thr Arg Tyr Asp Glu Ala His Phe Leu Gly Gly 
145 ~ 150 155 160 

ggt gaa gat gtt tct ttt ggg aaa aag aga aaa gac ttt gac ttt gag 767 
Gly Glu Asp Val Ser Phe Gly Lys Lys Arg Lys Asp Phe Asp Phe Glu 
165 170 175 
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aaa aaa ctt ctt caa gat gag agt gat cca tea tct tct tct tec aag 
Lyl Lys III Leu G?n Asp Glu sir Asp Pro Ser Ser Ser Ser Ser Lys 
180 185 190 

aaa act aaa att gtt tgg tct ttt gag ctt cat cat aag ttt gtc aac 
Lys Ma Ar| Val Val T% Ser Phe Glu Leu His His Lys Phe Val Asn 
7 19 | 200 205 

ace att aac caa ate gga tgc gat cac aaa get ggt ccc aag aag ata 
Ala Val Isn Gin lie Gly C*s Asp His Lys Ala Gly Pro Lye Lys lie 
210 215 220 

ttg gat etc atg aat gtt cca tgg etc act aga gaa aat gtt gca age 
ill Asp Leu Met Asn Val Pro Trp Leu Thr Arg Glu Asn Val Ala Ser 
225 230 235 240 

cac ctt caq aaa tat aga ctt tac ctg age aga tta gag aaa gga aag 
His Leu Sin Lys Tyr Arg Leu Tyr Leu Ser Arg Leu Glu Lys Gly Lys 
245 250 

aaa etc aaa tat tat tea ggt ggc gtg aag aat gcg gat tea tct cca 
Glu Leu lyl C?s Tyr Ser Gly Gly Val Lys Asn Ala Asp Ser Ser Pro 
260 265 270 

aaa aat ate aaa gtg aat tea ggc tac caa age cct ggg agg age age 
lys Asp vll G?u Val Asn Ser Gly Tyr Gin Ser Pro Gly Arg Ser Ser 
275 280 285 

tat qta ttc tct gga gga aat tct ctg ate caa aaa gca aca gag att 
T^r Val Phe Ser Gly Gly Asn Ser Leu lie Gin Lys Ala Thr Glu lie 
290 295 300 

gat cca aag cca ctt get tea get tct ttg tct gac ccc aac ace gat 
Asp Pro Lys Pro Leu Ala Ser Ala Ser Leu Ser Asp Pro Asn Thr Asp 
305 310 315 

qtq ate atg cct ccg aaa aca aaa aag acg cgt ata gga ttt gat cct 
Val lie Me? Pro Pro Lys Thr Lys Lys Thr Arg lie Gly Phe Asp Pro 
325 330 335 

ccc att tcc'tcc tct gcg ttt gac tct ctg ctt cct tgg aat gat gtt 
Pro lie Ser Ser Ser Ala Phe Asp Ser Leu Leu Pro Trp Asn Asp Val 
340 345 350 

cca qaq ate ctt gaa teg aag ccg gtt ctg tat gag aat age ttt etc 
Pro G?u wit Leu Glu Se? Ly! Pro Val Leu Tyr Glu Asn Ser Phe Leu 
355 360 365 

cag caa caa cca ttg cca agt caa agt tee tat gtt gca att tct gca 
Gin Gin Gin Pro Leu Pro Ser Gin Ser Ser Tyr Val Ala He Ser Ala 
370 375 380 

cca tct etc atg gag gag gaa atg aag cct cct tat gag aca cca gca 
Pro Ser LeC Me? Glu 111 Glu Me? Lys Pro Pro Tyr Glu Thr Pro Ala 
385 390 395 4 ™ 

o«a aar aat aat qtq aat gca gat gag ttt etc atg cca caa gac aag 
flylfy l?r llr Val Asn Ala Asp Gl? Phe Leu Met Pro Gin Asp Lys 
405 410 41b 

ate cct act gta ace ctt caa gat ttg gat ccc tct gee atg aag gj:g 
lie Pro Thr Val Thr Leu Gin Asp Leu Asp Pro Ser Ala Met Lys Leu - 
420 425 430 

cag gag ttc aac aca gaa ggc gat tct gaa gaa get tga actggggaac 
Gin Glu Phe Asn Thr Glu Gly Asp Ser Glu Glu Ala 

435 440 
ttccagaatc acatcattct gtttctttag acactgactt agacttgact tggcttcaag 
gcgagcgttt ettgeaaaca ccgactccag tttcaagata cagtagtagc ccatcactcc 
tatctgagct cccagcccac cttaattggt atggaaatga gcggctgcct gaccctgacg 
agtattcctt catggtagac caaggtttat tcatatctta accttgttcc aataacttct 
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815 

863 

911 

959 

1007 

1055 

1103 

1151 

1199 

1247 

1295 

1343 

1391 

1439 

1487 

1535 

1584 

1644 
1704 
1764 
1824 
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tttcgtatat tggttggtgt aatgcagaaa gattttgtgg gtatacctga aaataatctt 1884 

gctttcccaa gaaccttcca tgatcggatg cattgtacaa taatccacga gtgtcgtagg 1944 

ctaattacac caaacaggtt gatgacagtg ataaggccac atgtttcaca ccgtcgctta 2004 

agatctttac tgtcacctgg aaggaaa 2031 

<210> 14 
<211> 444 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 14 

Met Glu Lys Ser Gly Phe Ser Pro Val Gly Leu Arg Val Leu Val Val 
15 10 15 

Asp Asp Asp Pro Thr Trp Leu Lys He Leu Glu Lys Met Leu Lys Lys 
20 25 30 

Cys Ser Tyr Glu Val Thr Thr Cys Gly Leu Ala Arg Glu Ala Leu Arg 
35 40 45 

Leu Leu Arg Glu Arg Lys Asp Gly Tyr Asp He Val He Ser Asp Val 
50 55 60 

Asn Met Pro Asp Met Asp Gly Phe Lys Leu Leu Glu His Val Gly Leu 
65 70 75 80 

Glu Leu Asp Leu Pro Val He Met Met Ser Val Asp Gly Glu Thr Ser 
85 90 95 

Arg Val Met Lys Gly Val His Thr Gly Ala Cys Asp Tyr Leu Leu Lys 
100 105 110 

Pro He Arg Met Lys Glu Leu Lys He He Trp Gin His Val Leu Arg 
115 * 120 125 

Lys Lys Leu Gin Glu Val Arg Asp He Glu Gly Cys Gly Tyr Glu Gly 
130 135 140 

Gly Ala Asp Trp He Thr Arg Tyr Asp Glu Ala His Phe Leu Gly Gly 
145 150 155 160 

Gly Glu Asp Val Ser Phe Gly Lys Lys Arg Lys Asp Phe Asp Phe Glu 
165 170 175 

Lys Lys Leu Leu Gin Asp Glu Ser Asp Pro Ser Ser Ser Ser Ser Lys 
180 185 190 

Lys Ala Arg Val Val Trp Ser Phe Glu Leu His His Lys Phe Val Asn 
195 200 205 

Ala Val Asn Gin He Gly Cys Asp His Lys Ala Gly Pro Lys Lys He 
210 215 220 

Leu Asp Leu Met Asn Val Pro Trp Leu Thr Arg Glu Asn Val Ala Ser 
225 230 235 240 
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His Leu Gin Lys Tyr Arg Leu Tyr Leu Ser Arg Leu Glu Lys Gly Lys 
245 250 255 

Glu Leu Lys Cys Tyr Ser Gly Gly Val Lys Asn Ala Asp Ser Ser Pro 
260 265 

Lys Asp Val Glu Val Asn Ser Gly Tyr Gin Ser Pro Gly Arg Ser Ser 
275 280 

Tvr val Phe Ser Gly Gly Asn Ser Leu He Gin Lys Ala Thr Glu He 
290 295 300 

Asp Pro Lys Pro Leu Ala Ser Ala Ser Leu Ser Asp Pro Asn Thr Asp 
305 310 315 

Val He Met Pro Pro Lys Thr Lys Lys Thr Arg He Gly Phe Asp Pro 
325 330 33b 

Pro He Ser Ser Ser Ala Phe Asp Ser Leu Leu Pro Trp Asn Asp Val 
340 345 350 

Pro Glu Val Leu Glu Ser Lys Pro Val Leu Tyr Glu Asn Ser Phe Leu 
355 360 365 

Gin Gin Gin Pro Leu Pro Ser Gin Ser Ser Tyr Val Ala He Ser Ala 
370 375 380 

Pro Ser Leu Met Glu Glu Glu Met Lys Pro Pro Tyr Glu Thr Pro Ala 
385 390 395 400 

Gly Gly Ser Ser val Asn Ala Asp Glu Phe Leu Met Pro Gin Asp Lys 
405 410 

lie Pro Thr Val Thr Leu Gin Asp Leu Asp Pro Ser Ala Met Lys Leu 
420 425 3 

Gin Glu Phe Asn Thr Glu Gly Asp Ser Glu Glu Ala 
435 44 <> 

<210> 15 
<211> 2821 
<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> CDS 

<222> (188) . . (2716) 

<223> G438 

cggggtaccc aagccacgac cgtagaatct tcttttgtct gaaaagaatt acaatttacg 60 

tttctcttac gatacgacgg actttccgaa gaaattaatt taaagagaaa agaagaagaa 120 

gccaaagaag aagaagaagc tagaagaaac agtaaagttt gagacttttt ttgagggtcg 180 

aactaaa atq qag atg gcg gtg get aac cac cgt gag aga age agt gac 229 
agctaaa atg gag acg y^ ^ Hi8 G lu Arg Ser Ser Asp 

1 5 10 
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agt atg aat aga cat tta gat agt age ggt aag tac gtt agg tac aca 277 

Ser Met Asn Arg His Leu Asp Ser Ser Gly Lys Tyr Val Arg Tyr Thr 
15 20 25 30 

get gag caa gtc gag get ctt gag cgt gtc tac get gag tgt cct aag 325 

Ala Glu Gin Val Glu Ala Leu Glu Arg Val Tyr Ala Glu Cys Pro Lys 
35 40 45 

cct age tct etc cgt cga caa caa ttg ate cgt gaa tgt tec att ttg 373 

Pro Ser Ser Leu Arg Arg Gin Gin Leu lie Arg Glu Cys Ser He Leu 
50 55 60 

gec aat att gag cct aag cag ate aaa gtc tgg ttt cag aac cgc agg 421 

Ala Asn He Glu Pro Lys Gin He Lys Val Trp Phe Gin Asn Arg Arg 
65 70 75 

tgt cga gat aag cag agg aaa gag gcg teg agg etc cag age gta aac 469 

Cys Arg Asp Lys Gin Arg Lys Glu Ala Ser Arg Leu Gin Ser Val Asn 

80 85 90 

egg aag etc tct gcg atg aat aaa ctg ttg atg gag gag aat gat agg 517 

Arg Lys Leu Ser Ala Met Asn Lys Leu Leu Met Glu Glu Asn Asp Arg 
95 100 105 110 

ttg cag aag cag gtt tct cag ctt gtc tgc gaa aat gga tat atg aaa 565 

Leu Gin Lys Gin Val Ser Gin Leu Val Cys Glu Asn Gly Tyr Met Lys 
115 120 125 

cag cag eta act act gtt gtt aac gat cca age tgt gaa tct gtg gtc 613 

Gin Gin Leu Thr Thr Val Val Asn Asp Pro Ser Cys Glu Ser Val Val 
130 135 140 

aca act cct cag cat teg ctt aga gat gcg aat agt cct get gga ttg 661 

Thr Thr Pro Gin His Ser Leu Arg Asp Ala Asn Ser Pro Ala Gly Leu 
145 150 155 

etc tea ate gca gag gag act ttg gca gag ttc eta tec aag get aca 709 

Leu Ser He Ala Glu Glu Thr Leu Ala Glu Phe Leu Ser Lys Ala Thr 

160 165 170 

gga act get gtt gat tgg gtt cag atg cct ggg atg aag cct ggt ccg 757 

Gly Thr Ala Val Asp Trp Val Gin Met Pro Gly Met Lys Pro Gly Pro 
175 180 185 190 



gat teg gtt ggc ate ttt gee att teg caa aga tgc aat gga gtg gca 
Asp Ser Val Gly He Phe Ala He Ser Gin Arg Cys Asn Gly Val Ala 
195 200 205 



805 



get cga gee tgt ggt ctt gtt age tta gaa cct atg aag att gca gag 853 
Ala Arg Ala Cys Gly Leu Val Ser Leu Glu Pro Met Lys He Ala Glu 
210 215 220 

ate etc aaa gat egg cca tct tgg ttc cgt gac tgt agg age ctt gaa 901 
He Leu Lys Asp Arg Pro Ser Trp Phe Arg Asp Cys Arg Ser Leu Glu 
225 230 235 

gtt ttc act atg ttc ccg get ggt aat ggt ggc aca ate gag ctt gtt 949 
Val Phe Thr Met Phe Pro Ala Gly Asn Gly Gly Thr He Glu Leu Val 
240 245 250 

tat atg cag acg tat gca cca acg act ctg get cct gec cgc gat ttc 997 
Tyr Met Gin Thr Tyr Ala Pro Thr Thr Leu Ala Pro Ala Arg Asp Phe 
255 260 265 270 

tgg ace ctg aga tac aca acg age etc gac aat ggg agt ttt gtg gtt 1045 
Trp Thr Leu Arg Tyr Thr Thr Ser Leu Asp Asn Gly Ser Phe Val Val 
275 280 285 

tgt gag agg teg eta tct ggc tct gga get ggg cct aat get get tea 1093 
Cys Glu Arg Ser Leu Ser Gly Ser Gly Ala Gly Pro Asn Ala Ala Ser 
290 295 300 

get tct cag ttt gtg aga gca gaa atg ctt tct agt ggg tat tta ata 1141 
Ala Ser Gin Phe Val Arg Ala Glu Met Leu Ser Ser Gly Tyr Leu He 
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305 310 315 

agg cct tat gat ggt ggt ggt tct att att cac att gtc gat cac ctt 
III Pro Cys Asp Gly Gly Gly Ser lie lie His lie val Asp His Leu 

325 330 



320 



ntt aaa ac t tgq aqt gtt ccg gat gtg ctt cga ccc ctt tat gag 
Asn Leu ll2 lit Trp sir Val Prl Asp U Leu Arg Pro Leu Tyr Glu 



335 340 



tea tec aaa gtc gtt gca caa aaa atg acc att tec gcg ttg egg tat 
Ser Ser Lys Val Val Ala Gin Lye Met Thr He Ser Ala Leu Arg Tyr 



355 



400 



atn cat tqt gat gga gcg gaa gat att ate gtt get att aac tct aca 
Met H?s Cys Asp Gly All Glu Asp lie He Val Ala He Asn Ser Thr 
420 425 * J 



415 



aag cat ttg aat aat att tct aat tct ctt teg ttc ctt gga ggc gtg 
Lyl His Leu Asn Asn lie Ser Asn Ser Leu Ser Phe Leu Gly Gly Val 
* 435 440 445 

etc tqt qcc aag get tea atg ctt etc caa aat gtt cct cct gcg gtt 
Itu gs Ma Lys Ala Ser Me? Leu Leu Gin Asn Val Pro Pro Ala Val 

ttg ate egg ttc ctt aga gag cat cga tct gag tgg get gat ttc aat 
Leu lie Arg Phe Leu Arg Glu His Arg Ser Glu Trp Ala Asp Phe Asn 
465 470 475 

qtt qat qca tat tec get get aca ctt aaa get ggt age ttt get tat 
val Isp Ala Tyr Ser Ala Ala Thr Leu Lys Ala Gly Ser Phe Ala Tyr 
480 4 85 49° 

ccg gga atg aga cca aca aga ttc act ggg agt cag ate ata atg cca 
Pro Gly Met Arg Pro Thr Arg Phe Thr Gly Ser Gin He He Met Pro 
495 500 505 510 

eta qqa cat aca att gaa cac gaa gaa atg eta gaa gtt gtt aga ctg 
Leu Gly Tie ?hr lie Glu His Glu Glu Met Leu Glu Val Val Arg Leu 
520 52b 



515 



gaa ggt cat tct ctt get caa gaa gat gca ttt atg tea egg gat gtc 
Glu Gly His Ser Leu Ala Gin Glu Asp Ala Phe Met Ser Arg Asp Val 
530 535 540 

cat etc ctt cag att tgt acc ggg att gac gag aat gee gtt gga get 
His Leu Leu Gin He Cys Thr Gly lie Asp Glu Asn Ala Val Gly Ala 

550 555 



545 



1189 



1237 



1285 



1333 



1381 



ate agg caa tea gec caa gag tct aat ggt gaa gta gtg tat gga tta 
lie tot Gin Leu Ala Gin Glu Ser Asn Gly Glu Val Val Tyr Gly Leu 
370 375 38U 

gga agg cag cct get gtt ctt aga acc ttt age caa aga tta age agg 
Gly ArS Gin Pro Ala Val Leu Arg Thr Phe Ser Gin Arg Leu Ser Arg 
385 390 395 

aae ttc aat gat qcq gtt aat ggg ttt ggt gac gac ggg tgg tct acg 1429 
lly Phe Asn Asp III Val Asn Gly Phe Gly Asp Asp Gly Trp Ser Thr 

405 410 



1477 



1525 



1573 



1621 



1669 



1717 



1765 



1813 



1861 



tgt tct gaa ctg ata -ttt get ccg att aat gag atg ttc ccg gat gat 1909 
C*l ser Glu Leu He Phe Ala Pro He Asn Glu Met Phe Pro Asp Asp 
560 565 570 

get cca ctt gtt ccc tct gga ttc cga gtc ata ccc gtt gat get aaa 1957 
Ala Pro Leu Val Pro Ser Gly Phe Arg Val lie Pro Val Asp Ala Lys 
575 580 585 5*u 

acq gga gat gta caa gat ctg tta acc get aat cac cgt aca eta gac 
Thr Gly Asp Val Gin Asp Lei Leu Thr Ala Asn His Arg Thr Leu Asp 
595 600 

tta act tct age ctt gaa gtc ggt cca tea cct gag aat get tct gga 
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Leu Thr Ser Ser Leu Glu Val Gly Pro Ser Pro Glu Asn Ala Ser Gly 
610 615 620 

aac tct ttt tct age tea age teg aga tgt att etc act ate gcg ttt 2101 
Asn Ser Phe Ser Ser Ser Ser Ser Arg Cys He Leu Thr He Ala Phe 
625 630 635 

caa ttc cct ttt gaa aac aac ttg caa gaa aat gtt get ggt atg get 214 9 
Gin Phe Pro Phe Glu Asn Asn Leu Gin Glu Asn Val Ala Gly Met Ala 
640 645 650 

tgt cag tat gtg agg age gtg ate tea tea gtt caa cgt gtt gca atg 2197 
Cys Gin Tyr Val Arg Ser Val He Ser Ser Val Gin Arg Val Ala Met 
655 660 665 670 

gcg ate tea ccg tct ggg ata age ccg agt ctg ggc tec aaa ttg tec 224 5 
Ala He Ser Pro Ser Gly He Ser Pro Ser Leu Gly Ser Lys Leu Ser 
675 680 685 

cca gga tct cct gaa get gtt act ctt get cag tgg ate tct caa agt 2293 
Pro Gly Ser Pro Glu Ala Val Thr Leu Ala Gin Trp He Ser Gin Ser 
690 695 700 

tac agt cat cac tta ggc teg gag ttg ctg acg att gat tea ctt gga 2341 
Tyr Ser His His Leu Gly Ser Glu Leu Leu Thr He Asp Ser Leu Gly 
705 710 715 

age gac gac teg gta eta aaa ctt eta tgg gat cac caa gat gee ate 2389 
Ser Asp Asp Ser Val Leu Lys Leu Leu Trp Asp His Gin Asp Ala He 
720 725 730 

ctg tgt tgc tea tta aag cca cag cca gtg ttc atg ttt gcg aac caa 2437 
Leu Cys Cys Ser Leu Lys Pro Gin Pro Val Phe Met Phe Ala Asn Gin 
735 740 745 750 

get ggt eta gac atg eta gag aca aca ctt gta gee tta caa gat ata 24 85 

Ala Gly Leu Asp Met Leu Glu Thr Thr Leu Val Ala Leu Gin Asp He 
755 760 765 

aca etc gaa aag ata ttc gat gaa teg ggt cgt aag get ate tgt teg 2533 
Thr Leu Glu Lys He Phe Asp Glu Ser Gly Arg Lys Ala He Cys Ser 
770 775 780 

gac ttc gee aag eta atg caa cag gga ttt get tgc ttg cct tea gga 2581 
Asp Phe Ala Lys Leu Met Gin Gin Gly Phe Ala Cys Leu Pro Ser Gly 
785 790 795 

ate tgt gtg tea acg atg gga aga cat gtg agt tat gaa caa get gtt 2629 
He Cys Val Ser Thr Met Gly Arg His Val Ser Tyr Glu Gin Ala Val 
800 805 810 

get tgg aaa gtg ttt get gca tct gaa gaa aac aac aac aat ctg cat 2677 
Ala Trp Lys Val Phe Ala Ala Ser Glu Glu Asn Asn Asn Asn Leu His 
815 820 825 830 

tgt ctt gee ttc tec ttt gta aac tgg tct ttt gtg tga ttcgattgac 2726 
Cys Leu Ala Phe Ser Phe Val Asn Trp Ser Phe Val 
835 840 

agaaaaagac taatttaaat ttacgttaga gaactcaaat ttttggttgt tgtttaggtg 2786 

tctctgtttt gttttttaaa attattttga tcaaa 2821 

<210> 16 

<211> 842 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 16 

Met Glu Met Ala Val Ala Asn His Arg Glu Arg Ser Ser Asp Ser Met 
1 5 10 15 
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Asn ^ His Leu Asp Ser Ser Gly Lys Tyr VaX Arg Tyr Thr AXa GXu 
20 2 * 



, ^ Leu Glu Arg Val Tyr Ala Glu Cys Pro Lys Pro Ser 

Gin Val Glu Ala Leu fciu «rg v« / 45 

35 * u 

Ser Leu Ar 3 Arg Cln Gin Leu He Arg Glu Cys Ser XXe Leu AXa Asn 
50 5b 

Xle Glu Pro Lys GXn He Lys Val Trp Phe Gin Asn Arg Arg Cys Arg 
65 70 

A3P Lys Gin Arg Lys Glu Ala Ser Arg Leu Gin Ser Val Asn Arg Lys 
85 

Leu ser Ala Met Asn Lys Leu Leu Met Glu Glu Asn Asp Arg Leu Gin 
100 xo * 

Lya Sl n val ser Gin Leu Val Cys Glu Asn Gly Tyr Met Lys Gin Gin 
115 120 

Leu Thr Thr Val Val Asn Asp Pro Ser Cys Glu Ser Val val Thr Thr 
130 135 

Pro Gin His Ser Leu Arg Asp Ala Asn Ser Pro Ala Gly Leu Leu Ser 
145 150 

Ile Ala Glu Glu Thr Leu Ala Glu Phe Leu Ser Lys Ala T h r Gly Thr 

Ma val Asp Trp Val Gin Met Pro Gly Met Lys Pro Gly Pro Asp Ser 
180 185 

«i nrra Ann Glv Val Ala Ala Arg 

Val Gly He Phe Ala He Ser GXn Arg Cys Asn Gly ^ 

195 200 

Ma cys GXy Leu vaX Ser Leu GXu Pro Met Lys XXe Ma GXu XXe Leu 
210 21b 

Ly s Asp Arg Pro Ser Trp Phe Arg Asp Cys Arg Ser Leu GXu VaX Phe 
225 230 

Thr Met Phe Pro AXa Gly Asn Gly Gly Thr Xle Glu Leu Val Tyr Met 
245 

G1 n Thr Tyr Ala Pro Thr Thr Leu Ala Pro Ala Arg Asp Phe Trp Thr 
260 

L eu Arg Tyr Thr Thr Ser Leu Asp Asn Gly Ser Phe Val Val Cys Glu 
275 2au 

^ cor- riv Ala Gly Pro Asn Ala Ala Ser Ala Ser 
Arg Ser Leu Ser Gly Ser Gly Ala ciy r ^ 

290 

G1 „ Phe val Arg Ala GXu Met Leu Ser Ser GXy Tyr Leu He Arg Pro 
305 310 
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Cys Asp Gly Gly Gly Ser lie lie His lie Val Asp His Leu Asn Leu 
325 330 335 

Glu Ala Trp Ser Val Pro Asp Val Leu Arg Pro Leu Tyr Glu Ser Ser 
340 345 350 

Lys Val Val Ala Gin Lys Met Thr lie Ser Ala Leu Arg Tyr lie Arg 
355 360 365 

Gin Leu Ala Gin Glu Ser Asn Gly Glu Val Val Tyr Gly Leu Gly Arg 
370 375 380 

Gin Pro Ala Val Leu Arg Thr Phe Ser Gin Arg Leu Ser Arg Gly Phe 
385 390 395 400 

Asn Asp Ala Val Asn Gly Phe Gly Asp Asp Gly Trp Ser Thr Met His 
405 410 415 

Cys Asp Gly Ala Glu Asp He He Val Ala lie Asn Ser Thr Lys His 
420 425 430 

Leu Asn Asn He Ser Asn Ser Leu Ser Phe Leu Gly Gly Val Leu Cys 
435 440 445 

Ala Lys Ala Ser Met Leu Leu Gin Asn Val Pro Pro Ala Val Leu He 
450 455 460 

Arg Phe Leu Arg Glu His Arg Ser Glu Trp Ala Asp Phe Asn Val Asp 
465 470 475 480 

Ala Tyr Ser Ala Ala Thr Leu Lys Ala Gly Ser Phe Ala Tyr Pro Gly 
485 490 495 

Met Arg Pro Thr Arg Phe Thr Gly Ser Gin He He Met Pro Leu Gly 
300 505 510 

His Thr He Glu His Glu Glu Met Leu Glu Val Val Arg Leu Glu Gly 
515 520 525 

His Ser Leu Ala Gin Glu Asp Ala Phe Met Ser Arg Asp Val His Leu 
530 535 540 

Leu Gin He Cys . Thr Gly He Asp Glu Asn Ala Val Gly Ala Cys Ser 
545 550 555 560 

Glu Leu He Phe Ala Pro He Asn Glu Met Phe Pro Asp Asp Ala Pro 
565 570 575 

Leu Val Pro Ser Gly Phe Arg Val He Pro Val Asp Ala Lys Thr Gly 
580 585 590 

Asp Val Gin Asp Leu Leu Thr Ala Asn His Arg Thr Leu Asp Leu Thr 
595 600 605 

Ser Ser Leu Glu Val Gly Pro Ser Pro Glu Asn Ala Ser Gly Asn Ser 
610 615 620 
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Ser Ser Ser Ser Ser Arg Cys lie Leu Thr lie Ala Phe Gin Phe 



Phe 

625 630 



635 640 



Pro Phe Glu Asn Asn Leu Gin Glu Asn val Ala Gly Met Ala Cys Gin 
645 650 655 

Tyr Val Arg Ser Val lie Ser Ser Val Gin Arg Val Ala Met Ala lie 
660 665 670 

Ser Pro Ser Gly lie Ser Pro Ser Leu Gly Ser Lys Leu Ser Pro Gly 
675 680 685 

Ser Pro Glu Ala Val Thr Leu Ala Gin Trp lie Ser Gin Ser Tyr Ser 
690 695 700. 

His His Leu Gly Ser Glu Leu Leu Thr lie Asp Ser Leu Gly Ser Asp 
705 710 715 

Asp Ser Val Leu Lys Leu Leu Trp Asp His Gin Asp Ala lie Leu Cys 
725 730 735 

Cys Ser Leu Lys Pro Gin Pro Val Phe Met Phe Ala Asn Gin Ala Gly 
740 745 750 

Leu Asp Met Leu Glu Thr Thr Leu Val Ala Leu Gin Asp lie Thr Leu 
755 760 765 

Glu Lys lie Phe Asp Glu Ser Gly Arg Lys Ala lie Cys Ser Asp Phe 
770 775 780 

Ala Lys Leu Met Gin Gin Gly Phe Ala Cys Leu Pro Ser Gly He Cys 
785 790 795 800 

Val Ser Thr Met Gly Arg His Val Ser Tyr Glu Gin Ala Val Ala Trp 
805 810 815 

Lys Val Phe Ala Ala Ser Glu Glu Asn Asn Asn Asn Leu His Cys Leu 
820 825 830 

Ala Phe Ser Phe Val Asn Trp Ser Phe Val 
835 840 

<210> 17 
<2H> 1888 
<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> CDS 

<222> (326) . . (1708) 
<223> G571 

tagccgacct ctcttctctc ttctgaaaaa aacaccaaag gagctttaaa tgctccgtta 60 
cataatctct atctctttcc aagaatatag agaaaggaaa ataatataca agaattaaaa 120 
gaaggtatat catcatctct ctagctagtg atcaaagcac cgtcatcatc atcatatatc 180 
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atcagcttgc ctcagaggag aagaccaaca taagagagat cgaagatcaa aatctatctc 240 

tcttcatcat cttctgctgt tactatcata tcacacgctc tctcaaacat catcctatat 300 

atagacttct cttcatcatc atcaa atg caa ggt cat cac cag aat cat cat 352 

Met Gin Gly His His Gin Asn His His 
1 5 

caa cac tta tea tea tec tec gec acg tct tec cat gga aac ttc atg 400 
Gin His Leu Ser Ser Ser Ser Ala Thr Ser Ser His Gly Asn Phe Met 
10 15 20 25 

aac aaa gat ggg tat gat att gga gag ata gac cca tea etc ttc etc 448 
Asn Lys Asp Gly Tyr Asp lie Gly Glu lie Asp Pro Ser Leu Phe Leu 
30 35 40 



tat ctt gat gga caa gga cat cat gat cct cca tea act get cct tct 
Tyr Leu Asp Gly Gin Gly His His Asp Pro Pro Ser Thr Ala Pro Ser 
45 50 55 



496 



736 



cct tta cat cat cat cac aca act cag aat ttg gcg atg aga cct cca 544 
Pro Leu His His His His Thr Thr Gin Asn Leu Ala Met Arg Pro Pro 
60 65 70 

aca teg acg etc aac ate ttt cca tct cag cct atg cac ata gag cca 592 
Thr Ser Thr Leu Asn lie Phe Pro Ser Gin Pro Met His lie Glu Pro 
75 80 85 

cct cct tct tct aca cac aat ace gat aat aca aga tta gtt ccg get 64 0 

Pro Pro Ser Ser Thr His Asn Thr Asp Asn Thr Arg Leu Val Pro Ala 
90 95 100 105 

get caa cct agt ggt tec act cga cca get tct gac ccg tec atg gac 688 
Ala Gin Pro Ser Gly Ser Thr Arg Pro Ala Ser Asp Pro Ser Met Asp 
110 115 120 

ttg ace aat cat tct cag ttt cat caa cct cct caa ggt tct aaa tec 
Leu Thr Asn His Ser Gin Phe His Gin Pro Pro Gin Gly Ser Lys Ser 
125 130 135 

ate aag aag gaa ggg aac cgc aag ggt ctt gee tea teg gac cat gac 784 
lie Lys Lys Glu Gly Asn Arg Lys Gly Leu Ala Ser Ser Asp His Asp 
140 145 150 

ata cct aaa teg tea gac cct aaa aca ttg aga aga eta gca caa aac 832 
He Pro Lys Ser Ser Asp Pro Lys Thr Leu Arg Arg Leu Ala Gin Asn 
155 160 165 

aga gaa gca gca aga aaa age aga tta cgt aaa aag get tat gtt cag 880 
Arg Glu Ala Ala Arg Lys Ser Arg Leu Arg Lys Lys Ala Tyr Val Gin 
170 175 180 185 

caa etc gag tea tgt agg ate aaa ctg ace caa eta gaa caa gag att 928 
Gin Leu Glu Ser Cys Arg He Lys Leu Thr Gin Leu Glu Gin Glu He 
190 195 200 

caa egg gee aga tec caa ggc gta ttc ttt gga ggg tct ctt ata gga 976 
Gin Arg Ala Arg Ser Gin Gly Val Phe Phe Gly Gly Ser Leu He Gly 
205 210 215 

gga gat caa cag caa ggt gga eta ccc att ggc cct ggc aac ate age 1024 
Gly Asp Gin Gin Gin Gly Gly Leu Pro lie Gly Pro Gly Asn He Ser 
220 225 230 

tct gaa gca gcg gtg ttc gat atg gaa tat gcg agg tgg ctg gag gag 1072 
Ser Glu Ala Ala Val Phe Asp Met Glu Tyr Ala Arg Trp Leu Glu Glu 
235 240 245 

cag cag agg eta tta aac gaa eta agg gtg gca aca caa gaa cac ttg 1120 
Gin Gin Arg Leu Leu Asn Glu Leu Arg Val Ala Thr Gin Glu His Leu 
250 255 260 265 

tec gag aac gag ctt agg atg ttt. gtg gac aca tgt tta get cat tat 1168 
Ser Glu Asn Glu Leu Arg Met Phe Val Asp Thr Cys Leu Ala His Tyr 
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275 280 



B «" a s; s a a e a e si a s g e s 

SS K SS S If? E S "', S S 155 K f§ 85 S 3 

300 305 310 

?s s a; a? a a ss a e a s a a s a s 

* 315 320 325 

caa ata qaa cca ttg acg gag caa cag ata gtt ggg ata tgt ggg ctg 
Gin He G?u Pro Leu Th? Glu Gin Gin lie Val Gly He Cys Gly Leu 
330 33 5 340 3 

caa cag tec aca caa gag gec gag gag get etc teg caa ggc etc gag 
Gin Gin Ser Thr Gin Glu Ala Glu Glu Ala Leu Ser Gin Gly Leu eiu 
350 3 55 

acq ttg aat caa tea ctt tec gat age att gtc tct gac tec etc ccg 
111 Leu Asn G?n Ser Leu Ser Asp Ser lie Val Ser Asp Ser Leu Pro 
365 37< > 375 

cct gee tec gca cca ctt cct cct cat eta tec aat ttc atg tea cac 
Pro Ala Ser Ala Pro Leu Pro Pro His Leu Ser Asn Phe Met Ser His 
380 385 390 

ata tec tta qct etc aac aag etc tct get etc gag ggc ttc gtt etc 
Mel Ser Leu Ma Leu Asn Lyl Leu Ser Ala Leu Glu Gly Phe Val Leu 
395 400 405 

rao QCa aat a at ttg agg cac caa acg ate cat agg ctg aac caa ttg 
SS IS Isp Asn III A?l His Gin Thr lie His Arg Leu Asn Gin Leu 
410 415 

ttg acg acc cgt caa gaa gca egg tgt ctt eta gee gtt gcg gag tac 
Leu Th? Thr Arg Gin Glu Ala Arg Cys Leu Leu Ala Val Ala Glu Tyr 
430 435 

t-t-r cac cat ctt caa get eta agt tct etc tgg eta gec cgt cct egg 
Pne Hit Arg Leu Gin Ala Leu sir Ser Leu Trp Leu Ala Arg Pro Arg 
445 450 4 " 

caa gat gga taa tactaaaaca actgatgaag gaaaccaaaa acaaaaacaa 
Gin Asp Gly 
460 

gagaataggt tgattagtta gccgccagct tgacctcttt atcatatata tcgtctctct 
actcaaatac agtgcaatta gggaaaattg tttggcttct ttttggtata tgattcttac 
tattatgttt ttaatcaaga 



1216 

1264 

1312 

1360 

1408 

1456 

1504 

1552 

1600 

1648 

1696 

1748 

1808 
1868 
1888 



<210> 18 
<211> 460 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 18 

Met Gin Gly His His Gin Asn His His Gin His Leu Ser Ser Ser Ser 
1 5 10 

Ala Thr Ser Ser His Gly Asn Phe Met Asn Lys Asp Gly Tyr Asp He 
20 25 30 

Gly Glu lie Asp Pro Ser Leu Phe Leu Tyr Leu Asp Gly Gin Gly His 
35 40 4b 
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His Asp Pro Pro Ser Thr Ala Pro Ser Pro Leu His His His His Thr 
50 55 60 

Thr Gin Asn Leu Ala Met Arg Pro Pro Thr Ser Thr Leu Asn He Phe 
65 70 75 80 

Pro Ser Gin Pro Met His He Glu Pro Pro Pro Ser Ser Thr His Asn 
85 90 95 

Thr Asp Asn Thr Arg Leu Val Pro Ala Ala Gin Pro Ser Gly Ser Thr 
100 105 HO 

Arg Pro Ala Ser Asp Pro Ser Met Asp Leu Thr Asn His Ser Gin Phe 
115 120 125 

His Gin Pro Pro Gin Gly Ser Lys Ser He Lys Lys Glu Gly Asn Arg 
130 135 140 

Lys Gly Leu Ala Ser Ser Asp His Asp lie Pro Lys Ser Ser Asp Pro 
145 150 155 160 

Lys Thr Leu Arg Arg Leu Ala Gin Asn Arg Glu Ala Ala Arg Lys Ser 
165 170 175 

Arg Leu Arg Lys Lys Ala Tyr Val Gin Gin Leu Glu Ser Cys Arg He 
180 185 190 

Lys Leu Thr Gin Leu Glu Gin Glu He Gin Arg Ala Arg Ser Gin Gly 
195 200 205 

Val Phe Phe Gly Gly Ser Leu He Gly Gly Asp Gin Gin Gin Gly Gly 
210 ** 215 220 

Leu Pro He Gly Pro Gly Asn He Ser Ser Glu Ala Ala Val Phe Asp 
225 230 235 240 

Met Glu Tyr Ala Arg Trp Leu Glu Glu Gin Gin Arg Leu Leu Asn Glu 
245 250 255 

Leu Arg Val Ala Thr Gin Glu His Leu Ser Glu Asn Glu Leu Arg Met 
260 265 270 

Phe Val Asp Thr Cys Leu Ala His Tyr Asp His Leu He Asn Leu Lys 
27S 280 285 



Ala Met Val Ala Lys Thr Asp Val 
290 295 



Lys Thr Pro Ala Glu Arg Cys Phe 
305 310 



Ser Glu lie He Lys Val He Val 
325 



Phe His Leu He Ser Gly Ala Trp 
300 



Leu Trp Met Gly Gly Phe Arg Pro 
315 320 



Asn Gin He Glu Pro Leu Thr Glu 
330 335 



Gin Gin lie Val- Gly He Cys Gly Leu Gin Gin Ser Thr Gin Glu Ala 
340 345 350 
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Glu Glu Ala Leu Ser 
355 



Gin Gly Leu Glu Ala Leu Asn Gin Ser Leu Ser 

365 



360 



Asp ser lie val Ser Asp Ser Leu Pro Pro Ala Ser Ala Pro Leu Pro 
370 375 



Pro His Leu Ser Asn Phe 3g5 



Met Ser His Met Ser Leu Ala Leu Asn Lys 



Leu Ser 



Ala Leu Glu Gly Phe Val Leu Gin Ala Asp Asn Leu Arg His 



405 



410 



Gin Thr He His Arg 



Leu Asn Gin Leu Leu Thr Thr Arg Gin Glu Ala 



Arg Cys Leu Leu 
435 



420 425 

Ala val Ala Glu Tyr Phe His Arg Leu Gin Ala Leu 

44b 



440 



Ser 



Ser Leu Trp Leu Ala Arg Pro Arg Gin Asp Gly 



450 



455 



<210> 19 

<211> 1707 

<212> DNA . 

<213> Arabidopsis thaliana 

<220> 

<221> CDS 

<222> (98) . . (1444) 

<223> G748 



ccacgcgtcc gcactctccc aaatctctct tctttaacaa caaaaaaaaa atcacagaga 60 
catagagaga agaagacgga acagaggctc caaaaaa atg atg atg gag act aga 115 



b ss & ne a s ss e a s s ss - s s «s 

10 15 

s k is s e s a e s s c b p c s k 
si? e e b s - s as s <s s 5 B s; ss e 

40 4 ^ 

5? S E S |S = S S K S S SS » S S 

e s s b is b ss s s k s k a s e a 
as c s s° e e ss ss a a s s s g K s 

90 95 

s s s s s » s a s ?s » a §j s s a 



163 



211 



259 



307 



355 



403 



451 



110 
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gag acc aag ttc tgt tat tac aac aac tac aac ata aac cag cct cgt 

' Glu Thr Lys Phe Cys Tyr Tyr Asn Asn Tyr Asn He Asn Gin Pro Arg 

120 125 130 

cat ttc tgc aag get tgt cag aga tat tgg act get gga ggg act atg 

His Phe Cys Lys Ala Cys Gin Arg Tyr Trp Thr Ala Gly Gly Thr Met 

135 140 145 150 

agg aat gtt cct gtg ggg gca gga cgt cgt aag aac aaa age tea tct 

Arg Asn Val Pro Val Gly Ala Gly Arg Arg Lys Asn Lys Ser Ser Ser 

155 160 165 

tct cat tac cgt cac ate act att tec gag get ctt gag get gcg agg 

Ser His Tyr Arg His He Thr lie Ser Glu Ala Leu Glu Ala Ala Arg 

170 175 180 

ctt gac ccg ggc tta cag gca aac aca agg gtc ttg agt ttt ggt etc 

Leu Asp Pro Gly Leu Gin Ala Asn Thr Arg Val Leu Ser Phe Gly Leu 

185 190 195 

gaa get cag cag cag cac gtt get get ccc atg aca cct gtt atg aag 

Glu Ala Gin Gin Gin His Val Ala Ala Pro Met Thr Pro Val Met Lys 

200 205 210 

eta caa gaa gat caa aag gtc tea aac ggt get agg aac agg ttt cac 

Leu Gin Glu Asp Gin Lys Val Ser Asn Gly Ala Arg Asn Arg Phe His 

215 220 225 230 

ggg tta gcg gat caa egg ctt gta get egg gta gag aat gga gat gat 

Gly Leu Ala Asp Gin Arg Leu Val Ala Arg Val Glu Asn Gly Asp Asp 

235 240 245 

tgc tea age gga tec tct gtg acc acc tct aac aat cac tea gtg gat 

Cys Ser Ser Gly Ser Ser Val Thr Thr Ser Asn Asn His Ser Val Asp 

250 255 260 

gaa tea aga gca caa age ggc agt gtt gtt gaa gca caa atg aac aac 

Glu Ser Arg Ala Gin Ser Gly Ser Val Val Glu Ala Gin Met Asn Asn 

265 270 275 

aac aac aac aat aac atg aat ggt tat get tgc ate cca ggt gtt cca 

Asn Asn Asn' Asn Asn Met Asn Gly Tyr Ala Cys He Pro Gly Val Pro 

280 285 290 

tgg cct tac acg tgg aat cca gcg atg cct cca cca ggt ttt tac ccg 

Trp Pro Tyr Thr Trp Asn Pro Ala Met Pro Pro Pro Gly Phe Tyr Pro 

295 300 305 310 

cct cca ggg tat cca atg ccg ttt tac cct tac tgg acc ate cca atg 

Pro Pro Gly Tyr Pro Met Pro Phe Tyr Pro Tyr Trp Thr He Pro Met 

315 320 325 

eta cca ccg cat caa tec tea teg cct ata age caa aag tgt tea aat 

Leu Pro Pro His Gin Ser Ser Ser Pro He Ser Gin Lys Cys Ser Asn 

330 335 340 

aca aac tct ccg act etc gga aag cat ccg aga gat gaa gga tea teg 

Thr Asn Ser Pro Thr Leu Gly Lys His Pro Arg Asp Glu Gly Ser Ser 

345 350 355 

aaa aag gac aat gag aca gag cga aaa cag aag gee ggg tgc gtt ctg 

Lys Lys Asp Asn Glu Thr Glu Arg Lys Gin Lys Ala Gly Cys Val Leu 

360 365 370 

gtc ccg aaa acg ttg aga ata gat gat cct aac gaa gca gca aag age 

Val Pro Lys Thr Leu Arg He Asp Asp Pro Asn Glu Ala Ala Lys Ser 

375 380 385 390 

teg ata tgg aca aca ttg gga ate aag aac gag gcg atg tgc aaa gee 

Ser He Trp Thr Thr Leu Gly He Lys Asn Glu Ala Met Cys Lys Ala 

395 400 405 

ggt 99^ atg ttc aaa ggg ttt gat cat aag aca aag atg tat aac aac 

Gly Gly Met Phe Lys Gly Phe Asp His Lys Thr Lys Met Tyr Asn Asn 

Page 32 



499 



547 



595 



643 



691 



739 



787 



835 



883 



931 



979 



1027 



1075 



1123 



1171 



1219 



1267 



1315 
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A^cL 420 
410 415 

s s b = is s e s a e ss s e e s 

425 430 



tea aga tea cac aat ttc cat gaa cag att tag agttacatat gtatatgtat 1464 
Ser A?g sir His Asn Phe His Glu Gin He 

445 



atatgtatga ttgattgtat gtatagatga tactggagaa tgatgagttt ttgagaatca X524 

aactcttttc ttctttctag tgattgeett tattccttta catgttttgg ttctctgtac 1584 

actatttgat ttaccttttt tactttcttt cttcatttgt caggaaatgt tggaagataa 1644 

cattaatggt aaaaagttgg tgtggaccgt tgttgcgttg gcatttcaaa aaaaaaaaaa 1704 



aaa 



<210> 20 
<211> 448 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 20 

„et Met Met Glu Thr Arg Asp Pro Ala lie Lys Leu Phe Gly Met Lys 

1 5 10 

ne Pro Phe Pro Ser Val Phe Glu Ser Ala Val Thr Val Glu Asp Asp 
20 25 

Glu Glu Asp Asp Trp Ser Gly Gly Asp Asp Lys Ser Pro Glu Lys Val 



35 



Thr Pro Glu Leu Ser Asp Lys Asn Asn Asn Asn Cys Asn Asp Asn Ser 
50 55 

Phe Asn Asn Ser Lys Pro Glu Thr Leu Asp Lys Glu Glu Ala Thr Ser 
65 70 

Thr Asp Gin lie Glu Ser Ser Asp Thr Pro Glu Asp Asn Gin Gin Thr 
85 90 

Thr Pro Asp Gly Lys Thr Leu Lys Lys Pro Thr Lys He Leu Pro Cys 
100 

Pro Arg Cys Lys Ser Met Glu Thr Lys Phe Cys Tyr Tyr Asn Asn Tyr 
115 120 

As „ He Asn Gin Pro Arg His Phe Cys Lys Ala Cys Gin Arg Tyr Trp 
130 135 

Thr Ala Gly Gly Thr Met Arg Asn Val Pro Val Gly Ala Gly Arg Arg 

Lys Asn Lys Ser Ser Ser Ser His Tyr Arg His He Thr lie Ser Glu 
165 

Al. Leu Glu Ala Ala Arg Leu Asp Pro Gly Leu Gin Ala M Thr Arg 
180 18S 
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Val Leu Ser Phe Gly Leu Glu Ala Gin Gin Gin His Val Ala Ala Pro 
195 200 205 

Met Thr Pro Val Met Lys Leu Gin Glu Asp Gin Lys Val Ser Asn Gly 
210 215 220 

Ala Arg Asn Arg Phe His Gly Leu Ala Asp Gin Arg Leu Val Ala Arg 
225 " 230 235 240 

Val Glu Asn Gly Asp Asp Cys Ser Ser Gly Ser Ser Val Thr Thr Ser 
245 250 255 

Asn Asn His Ser Val Asp Glu Ser Arg Ala Gin Ser Gly Ser Val Val 
260 265 270 

Glu Ala Gin Met Asn Asn Asn Asn Asn Asn Asn Met Asn Gly Tyr Ala 
275 280 285 

Cys He Pro Gly Val Pro Trp Pro Tyr Thr Trp Asn Pro Ala Met Pro 
290 295 300 

Pro Pro Gly Phe Tyr Pro Pro Pro Gly Tyr Pro Met Pro Phe Tyr Pro 
305 310 315 320 

Tyr Trp Thr He Pro Met Leu Pro Pro His Gin Ser Ser Ser Pro He 
325 330 335 

Ser Gin Lys Cys Ser Asn Thr Asn Ser Pro Thr Leu Gly Lys His Pro 
340 345 350 

Arg Asp Glu Gly Ser Ser Lys Lys Asp Asn Glu Thr Glu Arg Lys Gin 
355 360 365 

Lys Ala Gly Cys Val Leu Val Pro Lys Thr Leu Arg He Asp Asp Pro 
370 375 380 



Asn Glu Ala Ala Lys Ser Ser lie Trp Thr Thr Leu Gly He Lys Asn 
385 390 395 400 

Glu Ala Met Cys Lys Ala Gly Gly Met Phe Lys Gly Phe Asp His Lys 
405 410 415 

Thr Lys Met Tyr Asn Asn Asp Lys Ala Glu Asn Ser Pro Val Leu Ser 
420 425 430 

Ala Asn Pro Ala Ala Leu Ser Arg Ser His Asn Phe His Glu Gin lie 

440 445 





435 


<210> 


21 


<211> 


1149 


<212> 


. DNA 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


<222> 


(1) . . (1149) 


<223> 


G431 
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96 



144 



192 



BBS ss ss s s s s sr. s a is SS K 1?? 

e £ ss is Is !s a a s » a » » » s 555 
s s s is sr. ss sr. sr. ss sr. e sr. sr. sr. ss sr. 
«. ,» 1L r K « E « ss is sr. sr. ss ss ss ss 

Gin Glu His Asp Gly Tyr Aia iyr 6Q 
50 

£ SS SS SS SS SS SS E ss ss ss s s s ss s: 

70 

e K ss ss ss ss ss ss ss ss ss ss ss ss ss ss 
s s ss sr. a ss is ss e k « ss s k ss sis 

1()0 1UD 

s ss ss ss ss ss ss ss ss a s ss ffi sss is ss 
ss sr. !s ss ss ss ss ss ss ss e ss ss ss ss ?ss 
b sis SS SS IS SS S3 ss sss s § ss ss ss ss ss 
B k sr. B ss is SS S! B ss s a? s s ss ss 

165 

e SS II? ss ss ss ss ss ss ss ss ss ss ss ss ss 
ss ss is ss ss ss ss ss ss ss ss ss ss a is ss 

195 20 
ctt caa cgt gtc gag tgt caa ttc aaa tec etc tct eta tec tea 
Leu Gin Arg Val Glu Cya Gin Phe Lye se ^ 

210 215 t aafc 

ss ss ss ss ss ss ss; ss ss ss ss ss ss ss s» - 

225 230 

8 SS SS SS SS SS «5 8 S! g SS SS SS SS IS SS 

1 245 

ss ss ss ss ss ss ss ss 5 ss ss ss si ss ss ss 

260 ^ 864 

is ss ss si? ss ss s ss ss ss si ss ss ss SS 5. 

, Ill ~ - — ~ « " 199 " * 512 



576 



624 



672 



720 



816 
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Gly Lys Leu Pro Lys Glu Ala Arg Gin Gin Leu Leu Asp Trp Trp Ser 
290 295 300 

cgt cac tac aaa tgg cct tac cct teg gag caa caa aag etc gec ctt 960 
Arg His Tyr Lys Trp Pro Tyr Pro Ser Glu Gin Gin Lys Leu Ala Leu 
305 "310 . ( 315 320 

gcg gaa tea acg ggg ctg gac cag aaa cag ata aac aat tgg ttc ata 1008 
Ala Glu Ser Thr Gly Leu Asp Gin Lys Gin lie Asn Asn Trp Phe lie 
325 330 335 

aac cag agg aaa egg cat tgg aag ccg teg gag gac atg cag ttt gta 1056 
Asn Gin Arg Lys Arg His Trp Lys Pro Ser Glu Asp Met Gin Phe Val 
340 345 350 

gta atg gac gca aca cat cct cac cat tac ttc atg gat aat gtc ttg 1104 
Val Met Asp Ala Thr His Pro His His Tyr Phe Met Asp Asn Val Leu 
355 360 365 

gac aat cct ttc cca atg gat cac ate tec tec ace atg ctt tga 1149 
Asp Asn Pro Phe Pro Met Asp His lie Ser Ser Thr Met Leu 
370 375 . 380 

<210> 22 

<211> 382 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 22 

Met Glu Ser Gly Ser Asn Ser Thr Ser Cys Pro Met Ala Phe Ala Gly 
1 5 10 15 

Asp Asn Ser Asp Gly Pro Met Cys Pro Met Met Met Met Met Pro Pro 
20 25 30 

He Met Thr Ser His Gin His His Gly His Asp His Gin His Gin Gin 
35 40 45 

Gin Glu His Asp Gly Tyr Ala Tyr Gin Ser His His Gin Gin Ser Ser 
50 55 60 

Ser Leu Phe Leu Gin Ser Leu Ala Pro Pro Gin Gly Thr Lys Asn Lys 
65 70 75 80 

Val Ala Ser Ser Ser Ser Pro Ser Ser Cys Ala Pro Ala Tyr Ser Leu 
85 90 95 

Met Glu He His His Asn Glu He Val Ala Gly Gly He Asn Pro Cys 
100 105 110 

Ser Ser Phe Ser Ser Ser Ala Ser Val Lys Ala Lys He Met Ala His 
115 120 125 

Pro His Tyr His Arg Leu Leu Ala Ala Tyr Val Asn Cys Gin Lys Val 
130 135 140 

Gly Ala Pro Pro Glu Val Val Ala Arg Leu Glu Glu Ala Cys Ser Ser 
145' 150 155 160 

Ala Ala Ala Ala Ala Ala Ser Met Gly Pro Thr Gly Cys Leu Gly Glu 
165 170 175 
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ASP Pro Gly Leu Asp «. «» Glu Ma Tyr Cys Glu Met 

180 

Le „ ou « « - cv- £ - - - S - S " ~ "° 

210 

^ s „ phe « « ffi ■» « « III ~ *" " " S 
225 

01y « s« «. «. « « ~ « s - - ~ v " a K ° 

„ Ala 0* »P « - - - » " *" * *" " 

260 

Gly v „ - - - a ~ » - - si - lv " 

275 

. nu Ala Arg Gin Gin Leu Leu Asp Trp Trp Ser 
Gly Lys Leu Pro Lys Glu Ala Arg » 300 
290 

, t™ Pro Tyr Pro Ser Glu Gin Gin Lys Leu Ala Leu 
. Arg His Tyr Lys Trp Pro Tyr * n5 

305 

M «. « «. -j - « - - a "° A= " » ? 

nw, «5er Glu Asp Met Gin Phe Val 
Asn Gin Arg Lys Arg His Trp Lys Pro Ser 

340 

t^o Hie Tvr Phe Met Asp Asn Val Leu 
Val Met Asp Ala Thr His Pro Hxs His Tyr ^ 
355 

u^o tip Ser Ser Thr Met Leu 
ASP Asn Pro Phe Pro Met Asp Hxs lie Ser ^ 



PCT/US00/31325 



<210> 23 

<211> 1136 

fan: Arabidopsis thaliana 
<220> 

<221> CDS 

*222> (118).- (1074) 

<223> G187 



ossc^. ««— ta * tccoc " cet " ctc,t 

— — • — ' ~rrr. 

a sr. e s a « - £ £ s e e e u 
is e a k a a s a - - s " 55 f s ffi 



60 
117 
165 



20 
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tat gga aac etc cca tea caa aac ggt ttt aat cca tct act tat tec 
Tyr Gly Asn Leu Pro Ser Gin Asn Gly Phe Asn Pro Ser Thr Tyr Ser 
35 40 45 

ttc ace gat tgt etc caa agt tct cca gca gcg tat gaa tct eta ctt 
Phe Thr Asp Cys Leu Gin Ser Ser Pro Ala Ala Tyr Glu Ser Leu Leu 
50 55 60 

cag aaa act ttt ggt ctt tct ccc tct tec tea gag gtt ttc aat tct 
Gin Lys Thr Phe Gly Leu Ser Pro Ser Ser Ser Glu Val Phe Asn Ser 
65 70 75 80 

teg ate gat caa gaa ccg aac cgt gat gtt act aat gac gta ate aat 
Ser lie Asp Gin Glu Pro Asn Arg Asp Val Thr Asn Asp Val lie Asn 
85 90 95 



gag aga teg ttc caa gat cca acg gtt gtg att aca act tac gag ggt 
Glu Arg Ser Phe Gin Asp Pro Thr Val Val lie Thr Thr Tyr Glu Gly 
210 215 220 



get get atg ttc tec gca gac etc atg act cca aga age ttt gca cat 
Ala Ala Met Phe Ser Ala Asp Leu Met Thr Pro Arg Ser Phe Ala His 
245 250 255 



261 



309 



357 



405 



g9t ggt gca tgc aac gag act gaa act agg gtt tct cct tct aat tct 4 53 

Gly Gly Ala Cys Asn Glu Thr Glu Thr Arg Val Ser Pro Ser Asn Ser 
100 105 110 

tec tct agt gag get gat cac ccc ggt gaa gat tec ggt aag age egg 501 

Ser Ser Ser Glu Ala Asp His Pro Gly Glu Asp Ser Gly Lys Ser Arg 
115 120 125 

agg aaa cga gag tta gtc ggt gaa gaa gat caa att tec aaa aaa gtt 549 

Arg Lys Arg Glu Leu Val Gly Glu Glu Asp Gin He Ser Lys Lys Val 
130 135 140 

ggg aaa acg aaa aag act gag gtg aag aaa caa aga gag cca cga gtc 597 

Gly Lys Thr Lys Lys Thr Glu Val Lys Lys Gin Arg Glu Pro Arg Val 

145 150 155 160 

teg ttt atg act aaa agt gaa gtt gat cat ctt gaa gat ggt tat aga 645 

Ser Phe Met Thr Lys Ser Glu Val Asp His Leu Glu Asp Gly Tyr Arg 

165 170 175 

tgg aga aaa tac ggc caa aag get gta aaa aat age cct tat cca agg 693 

Trp Arg Lys Tyr Gly Gin Lys Ala Val Lys Asn Ser Pro Tyr Pro Arg 
180 185 190 

agt tac tat aga tgt aca aca caa aag tgc aac gtg aag aaa cga gtg 741 

Ser Tyr Tyr Arg Cys Thr Thr Gin Lys Cys Asn Val Lys Lys Arg Val 
195 200 205 



789 



caa cac aac cac ccg att ccg act aat ctt cga gga agt tct gee gcg 837 
Gin His Asn His Pro He Pro Thr Asn Leu Arg Gly Ser Ser Ala Ala 
225 230 235 240 



885 



gat atg ttt agg acg gca get tat act aac ggc ggt tct gtg gcg gcg 933 

Asp Met Phe Arg Thr Ala Ala Tyr Thr Asn Gly Gly Ser Val Ala Ala 
260 265 270 

get ttg gat tat gga tat gga caa agt ggt tat ggt agt gtg aat tea 981 

Ala Leu Asp Tyr Gly Tyr Gly Gin Ser Gly Tyr Gly Ser Val Asn Ser 
275 280 285 

aac cct agt tct cac caa gtg tat cat caa ggg ggt gag tat gag etc 1029 

Asn Pro Ser Ser His Gin Val Tyr His Gin Gly Gly Glu Tyr Glu Leu 
290 295 300 

ttg agg gag att ttt cct tea att ttc ttt aag caa gag cct tga 1074 

Leu Arg Glu He Phe Pro Ser He Phe Phe Lys Gin Glu Pro 
305 310 315 

tcgatcattg ttataactac atatattata tatattgaga gagagaggta gagaaaaaaa 1134 
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1136 



aa 




<210> 
<211> 
<212> 
<213> 


24 

318 

PRT 

Arabidopsis thaliana 


<400> 


24 


Met Ser Asn Glu Thr Arg Asp 
1 5 



10 15 



Phe Ser Leu His Glu Met Met Asn Leu Pro Thr Ser Asn Pro Ser Ser 
20 25 

Tyr Gly Asn Leu Pro Ser Gin Asn Gly Phe Asn Pro Ser Thr Tyr Ser 

Phe Thr Asp cys Leu Gin Ser Ser Pro Ala Ala Tyr Glu Ser Leu Leu 
50 55 

Gin Lys Thr Phe Gly Leu Ser Pro Ser Ser Ser Glu Val Phe Asn Ser 



65 



Ser 



70 



lie Asp Gin Glu Pro Asn Arg Asp Val Thr Asn Asp Val lie Asn 



85 



Gly Gly Ala Cys Asn Glu Thr Glu Thr Arg Val Ser Pro Ser Asn Ser 
100 105 

Ser Ser Ser Glu Ala Asp His Pro Gly Glu Asp Ser Gly Lys Ser Arg 
115 120 

Arg Lys Arg Glu Leu Val Gly Glu Glu Asp Gin lie Ser Lys Lys Val 
130 135 

Gly Lys Thr Lys Lys Thr Glu Val Lys Lys Gin Arg Glu Pro Arg Val 
145 150 

Ser Phe Met Thr Lys Ser Glu Val Asp His Leu Glu Asp Gly Tyr Arg 

Trp Arg Lys Tyr Gly Gin Lys Ala Val Lys Asn Ser Pro Tyr Pro Arg 

Ser Tyr Tyr Arg Cys Thr Thr Gin Lys Cys Asn Val Lys Lys Arg Val 
195 200 

Glu Arg Ser Phe Gin Asp Pro Thr Val Val He Thr Thr Tyr Glu Gly 
210 215 

Gin His Asn His Pro lie Pro Thr Asn Leu Arg Gly Ser Ser Ala Ala 
225 230 

Ala Ala Met Phe Ser Ala Asp Leu Met Thr Pro Arg Ser Phe Ala His 



245 



A8 p Met Phe Arg Thr Ala Ala Tyr Thr Asn Gly Gly Ser Val Ala Ala 
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260 265 270 

Ala Leu Asp Tyr Gly Tyr Gly Gin Ser Gly Tyr Gly Ser Val Asn Ser 
275 280 285 

Asn Pro Ser Ser His Gin Val Tyr His Gin Gly Gly Glu Tyr Glu Leu 
290 295 300 

Leu Arg Glu lie Phe Pro Ser lie Phe Phe Lys Gin Glu Pro 
305 310 315 



48 



96 



<210> 25 

<211> 2580 

<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> CDS 

<222> (1)..(2580) 

<223> G470 

<400> 25 

atg gcg agt teg gag gtt tea atg aaa ggt aat cgt gga gga gat aac 

Met Ala Ser Ser Glu Val Ser Met Lys Gly Asn Arg Gly Gly Asp Asn 
1 5 10 15 

ttc tec tec tct ggt ttt agt gac cct aag gag act aga aat gtc tec 
Phe Ser Ser Ser Gly Phe Ser Asp Pro Lys Glu Thr Arg Asn Val Ser 
20 25 30 

gtc gec ggc gag ggg caa aaa agt aat tct ace cga tec get gcg get 144 
Val Ala Gly Glu Gly Gin Lys Ser Asn Ser Thr Arg Ser Ala Ala Ala 
35 40 45 

gag cgt get ttg gac cct gag get get ctt tac aga gag eta tgg cac 192 
Glu Arg Ala Leu Asp Pro Glu Ala Ala Leu Tyr Arg Glu Leu Trp His 
50 55 60 

get tgt get ggt ccg ctt gtg acg gtt cct aga caa gac gac cga gtc 240 
Ala Cys Ala Gly Pro Leu Val Thr Val Pro Arg Gin Asp Asp Arg Val 
65 70 75 80 

ttc tat ttt cct caa gga cac ate gag cag gtg gag get teg acg aac 288 
Phe Tyr Phe Pro Gin Gly His He Glu Gin Val Glu Ala Ser Thr Asn 
85 90 95 

cag gcg gca gaa caa cag atg cct etc tat gat ctt ccg tea aag ctt 336 
Gin Ala Ala Glu Gin Gin Met Pro Leu Tyr Asp Leu Pro Ser Lys Leu 
100 105 HO 

etc tgt cga gtt att aat gta gat tta aag gca gag gca gat aca gat 3 84 

Leu Cys Arg Val He Asn Val Asp Leu Lys Ala Glu Ala Asp Thr Asp 
115 120 125 

gaa gtt tat gcg cag att act ctt ctt cct gag get aat caa gac gag 432 
Glu Val Tyr Ala Gin He Thr Leu Leu Pro Glu Ala Asn Gin Asp Glu 
130 135 140 

aat gca att gag aaa gaa gcg cct ctt cct cca cct ccg agg ttc cag 480 
Asn Ala He Glu Lys Glu Ala Pro Leu Pro Pro Pro Pro Arg Phe Gin 
145 150 155 160 

gtg cat teg ttc tgc aaa acc ttg act gca tec gac aca agt aca cat 
Val His Ser Phe Cys Lys Thr Leu Thr Ala Ser Asp Thr Ser Thr His 
165 170 175 

ggt gga ttt tct gtt ctt agg cga cat gcg gat gaa tgt etc cca cct 576 
Gly Gly Phe Ser Val Leu Arg Arg His Ala Asp Glu Cys Leu Pro Pro 
180 185 190 



528 
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s ts a is si is is s s l " « I E s K 



195 200 



672 



720 



768 



816 



664 



912 



960 



1008 
1056 



a a k a e a a as s s a a si a a is 

210 215 

13 a si a a is k » 5S & « - « « s i 
a s « e a e k a s is a a si b £ k 

245 250 

b a a e a e a si g a a a s £ = « 

260 265 

a is a a a s a as a s s s $ e a k 

275 

s s a a s as s s w s sa is s e a a 

290 295 

a a a s a a a a s s g » s sa jg 

305 310 

s as s e a a s £ s s « » « s a a 

325 330 

a E a a a k s is b a a - s a s s 

340 34i> 

„ Mt cct act aqg tgg cca aaa tea aag tgg HO* 
$3 if? 555 l?u §S £ S fro K S S jg Ser Lys Trp 
355 360 

a a a a a a si s ss s - i - - ■« s 

370 375 

a a e is a a a «a a a a a s is is e 

385 390 

a a a a a a a a is a a a a a a a 

405 41U 

a a a a a a is a a a a a c £ s s 

420 42:> 

a s a a a is a a a a a si a a e si 
a a a a a s c a a s a ss a e ss a 

450 455 

a a a a a a a a a a a a a a a a 

465 470 

a a a e a « is a a a a a a a is is 



1152 



1200 



1248 



1296 



1344 



1392 



1440 



1488 
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aac tgg atg tec tea gec agg cat gaa cct act tac aca gat ttg etc 1536 
Asn Trp Met Ser Ser Ala Arg His Glu Pro Thr Tyr Thr Asp Leu Leu 
SO0 505 510 

tec ggc ttt ggg act aac ata gat cca tec cat ggt cag egg ata cct 1584 
Ser Glv Phe Gly Thr Asn He Asp Pro Ser His Gly Gin Arg He Pro 
r--.^ tzor\ <?5«; 



515 520 525 

ttt tat gac cat tea tea tea cct tct atg cct gca aag aga ate ttg 

Phe Tyr Asp His Ser Ser Ser Pro Ser Met Pro Ala Lys Arg He Leu 

530 535 540 

agt gat tea gaa ggc aag ttc gat tat ctt get aac cag tgg cag atg 

Ser Asp Ser Glu Gly Lys Phe Asp Tyr Leu Ala Asn Gin Trp Gin Met 

545 550 555 560 



1632 



1680 



1920 



ata cac tct ggt etc tec ctg aag tta cat gaa tct cct aag gta cct 1728 

He His Ser Gly Leu Ser Leu Lys Leu His Glu Ser Pro Lys Val Pro 
565 570 575 

gca gca act gat gcg tct etc caa ggg cga tgc aat gtt aaa tac age 1776 

Ala Ala Thr Asp Ala Ser Leu Gin Gly Arg Cys Asn Val Lys Tyr Ser 
580 585 590 

gaa tat cct gtt ctt aat ggt eta teg act gag aat get ggt ggt aac 1824 

Glu Tyr Pro Val Leu Asn Gly Leu Ser Thr Glu Asn Ala Gly Gly Asn 
595 600 605 

tgg cca ata cgt cca cgt get- ttg aat tat tat gag gaa gtg gtc aat 1872 

Trp Pro He Arg Pro Arg Ala Leu Asn Tyr Tyr Glu Glu Val Val Asn 

610 615 620 

get caa gcg caa get cag get agg gag caa gta aca aaa caa ccc ttc 

Ala Gin Ala Gin Ala Gin Ala Arg Glu Gin Val Thr Lys Gin Pro Phe 

625 630 635 640 

acg ata caa gag gag aca gca aag tea aga gaa ggg aac tgc agg etc 1968 

Thr He Gin Glu Glu Thr Ala Lys Ser Arg Glu Gly Asn Cys Arg Leu 
645 650 655 

ttt ggc att cct ctg ace aac aac atg aat ggg aca gac tea acc atg 2016 

Phe Gly He Pro Leu Thr Asn Asn Met Asn Gly Thr Asp Ser Thr Met 
660 665 670 

tct cag aga aac aac ttg aat gat get gcg ggg ctt aca cag ata gca 2064 

Ser Gin Arg Asn Asn Leu Asn Asp Ala Ala Gly Leu Thr Gin He Ala 
675 680 685 

tea cca aag gtt cag gac ctt tea gat cag tea aaa ggg tea aaa tea 2112 

Ser Pro Lys Val Gin Asp Leu Ser Asp Gin Ser Lys Gly Ser Lys Ser 

690 695 700 

aca aac gat cat cgt gaa cag gga aga cca ttc cag act aat aat cct 2160 

Thr Asn Asp His Arg Glu Gin Gly Arg Pro Phe Gin Thr Asn Asn Pro 

705 710 715 720 

cat ecg aag gat get caa acg aaa acc aac tea agt agg agt tgc aca 2208 

His Pro Lys Asp Ala Gin Thr Lys Thr Asn Ser Ser Arg Ser Cys Thr 
725 " 730 735 

aag gtt cac aag cag gga att gca ctt ggc cgt tea gtg gat ctt tea 

Lys val His Lys Gin Gly He Ala Leu Gly Arg Ser Val Asp Leu Ser 
740 745 750 

aag ttc caa aac tat gag gag tta gtc get gag ctg gac agg ctg ttt 

Lys Phe Gin Asn Tyr Glu Glu Leu Val Ala Glu Leu Asp Arg Leu Phe 

755 * 760 765 

gag ttc aat gga gag ttg atg get cct aag aaa gat tgg ttg ata gtt 2352 

Glu Phe Asn Gly Glu Leu Met Ala Pro Lys Lys Asp Trp Leu He Val 

770 775 780 

tac aca gat gaa gag aat gat atg atg ctt gtt ggt gac gat cct tgg 2400 

Tyr Thr Asp Glu Glu Asn Asp Met Met Leu Val Gly Asp Asp Pro Trp 
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785 



790 
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795 



800 



SS SS SB S5 SS 3 6 31 5 S pS S 5 S £ K 

805 810 

ss a 3 w s = a if? s a s ss a s is is 

820 825 

is ss c is ii? is it; s e e k s k a. s s 

835 840 

tea aat cct tea ttg tec age get ggg aac tct taa 
Ser Asn Pro Ser Leu Ser Ser Ala Gly Asn Ser 
850 855 



<210> 26 

<211> 859 

<212> PRT 

<213> Arabidopsis thalxana 

<400> 26 

Met Ala Ser Ser Glu Val Ser Met Lys Gly Asn Ar 9 Gly Gly Asp Asn 
1 5 10 



PCT/US00/31325 



2448 



Phe Ser Ser Ser Gly Phe Ser Asp Pro Lys Glu Thr Arg Asn Val Ser 
20 25 

val Ala Gly Glu Gly Gin Lys Ser Asn Ser Thr Arg Ser Ala Ala Ala 
35 40 

Glu Arg Ala Leu Asp Pro Glu Ala Ala Leu Tyr Arg Glu Leu Trp His 
50 55 

Ala Cys Ala Gly Pro Leu Val Thr Val Pro Arg Gin Asp Asp Arg Val 
65 70 S 

Phe Tyr Phe Pro Gin Gly His lie Glu Gin Val Glu Ala Ser Thr Asn 
85 90 

Gin Ala Ala Glu Gin Gin Met Pro Leu Tyr Asp Leu Pro Ser Lys Leu 



100 



Leu Cys Arg 
115 



Glu Val Tyr Ala 
130 



val lie Asn Val Asp Leu Lys Ala Glu Ala Asp Thr Asp 



120 



Gin lie Thr Leu Leu Pro Glu Ala Asn Gin Asp Glu 

140 



135 



A8 n Ala lie Glu Lys Glu Ala Pro Leu Pro Pro Pro Pro Arg Phe Gin 
145 150 

Val His Ser Phe Cys Lys Thr Leu Thr Ala Ser Asp Thr Ser Thr His 



165 



Gly Gly Phe Ser 



Val Leu Arg Arg His Ala Asp Glu Cys Leu Pro Pro 



180 



185 



^ Asp Met Ser Arg Gin Pro Pro Thr Gin Glu Leu Val Ala Lys Asp 
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195 200 205 

Leu His Ala Asn Glu Trp Arg Phe Arg His He Phe Arg Gly Gin Pro 
210 215 220 

Arq Arq His Leu Leu Gin Ser Gly Trp Ser Val Phe Val Ser Ser Lys 
225 230 235 240 

Arg Leu Val Ala Gly Asp Ala Phe He Phe Leu Arg Gly Glu Asn Gly 
245 250 255 

Glu Leu Arg Val Gly Val Arg Arg Ala Met Arg Gin Gin Gly Asn val 
260 265 270 

Pro Ser Ser Val He Ser Ser His Ser Met His Leu Gly Val Leu Ala 
275 280 285 

Thr Ala Trp His Ala He Ser Thr Gly Thr Met Phe Thr Val Tyr Tyr 
290 295 300 

Lys Pro Arg Thr Ser Pro Ser Glu Phe He Val Pro Phe Asp Gin Tyr 
305 310 315 320 

Met Glu Ser Val Lys Asn Asn Tyr Ser lie Gly Met Arg Phe Lys Met 
325 330 335 

Arg Phe Glu Gly Glu Glu Ala Pro Glu Gin Arg Phe Thr Gly Thr He 
340 345 350 

Val Gly He Glu Glu Ser Asp Pro Thr Arg Trp Pro Lys Ser Lys Trp 
355 360 365 

Arg Ser Leu Lys Val Arg Trp Asp Glu Thr Ser Ser He Pro Arg Pro 
370 375 380 

Asp Arg Val Ser Pro Trp Lys Val Glu Pro Ala Leu Ala Pro Pro Ala 
385 390 395 400 



Leu Ser Pro Val Pro Met Pro Arg Pro Lys Arg Pro Arg Ser Asn He 
405 410 415 

Ala Pro Ser Ser Pro Asp Ser Ser Met Leu Thr Arg Glu Gly Thr Thr 
420 425 430 

Lys Ala Asn Met Asp Pro Leu Pro Ala Ser Gly Leu Ser Arg Val Leu 
435 440 445 

Gin Gly Gin Glu Tyr Ser Thr Leu Arg Thr Lys His Thr Glu Ser Val 
450 455 460 

Glu Cys Asp Ala Pro Glu Asn Ser Val Val Trp Gin Ser Ser Ala Asp 
465 470 475 480 



Asp Asp Lys Val Asp Val Val Ser Gly Ser Arg Arg Tyr Gly Ser Glu 
48 f 490 495 
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Asn Trp Met Ser Ser Ala Arg His Glu Pro Thr Tyr Thr Asp Leu Leu 
500 505 510 

Ser Gly Phe Gly Thr Asn lie Asp Pro Ser His Gly Gin Arg lie Pro 
515 520 55 

Phe Tyr Asp His Ser Ser Ser Pro Ser Met Pro Ala Lys Arg lie Leu 
530 535 

Ser Asp Ser Glu Gly Lys Phe Asp Tyr Leu Ala Asn Gin Trp Gin Met 
545 550 

lie His Ser Gly Leu Ser Leu Lys Leu His Glu Ser Pro Lys Val Pro 

565 570 

Ala Ala Thr Asp Ala Ser Leu Gin Gly Arg Cys Asn Val Lys Tyr Ser 
580 585 3 

Glu Tyr Pro Val Leu Asn Gly Leu Ser Thr Glu Asn Ala Gly Gly Asn 
595 600 60i 

Trp Pro lie Arg Pro Arg Ala Leu Asn Tyr Tyr Glu Glu Val Val Asn 

610 « 15 
Ala Gin Ala Gin Ala Gin Ala Arg Glu Gin Val Thr Lys Gin Pro Phe 



625 



630 



Thr lie Gin Glu Glu Thr Ala Lys Ser Arg Glu Gly Asn Cys Arg Leu . 
645 650 

Phe Gly He Pro Leu Thr Asn Asn Met Asn Gly Thr Asp Ser Thr Met 
660 665 

Ser Gin Arg Asn Asn Leu Asn Asp Ala Ala Gly Leu Thr Gin lie Ala 
675 680 68b 

Ser Pro Lys Val Gin Asp Leu Ser Asp Gin Ser Lys Gly Ser Lys Ser 
690 695 700 

Thr Asn Asp His Arg Glu Gin Gly Arg Pro Phe Gin Thr Asn Asn Pro 

705 710 

His Pro Lys Asp Ala Gin Thr Lys Thr Asn Ser Ser Arg Ser Cys Thr 
725 '30 /JJ 

Lys Val His Lys Gin Gly He Ala Leu Gly Arg Ser Val Asp Leu Ser 
740 745 

Lys Phe Gin Asn Tyr Glu Glu Leu Val Ala Glu Leu Asp Arg Leu Phe 
755 760 7&;> 

Glu Phe Asn Gly Glu Leu Met Ala Pro Lys Lys Asp Trp Leu He Val 
770 775 

Tyr Thr Asp Glu Glu Asn Asp Met Met Leu Val Gly Asp Asp Pro Trp 
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Gin Glu Phe Cys Cys Met Val Arg Lys He Phe He Tyr Thr Lys Glu 
805 810 815 

Glu Val Arg Lys Met Asn Pro Gly Thr Leu Ser Cys Arg Ser Glu Glu 
820 825 830 

Glu Ala Val Val Gly Glu Gly Ser Asp Ala Lys Asp Ala Lys Ser Ala 
835 840 845 

Ser Asn Pro Ser Leu Ser Ser Ala Gly Asn Ser 
850 855 



<210> 


27 


<211> 


1519 


<212> 


DNA 


<213> 


Arabidopsis thaliana 


<220> 




<221> 


CDS 


<222> 


(197) . . (1252) 


<223> 


G615 


<400> 


27 



ttttttcttt tctttctttt tttgctggtg tgagaaattg tacgcttact atctctctct 60 

ctctctgcca gattctctct ttttgatgat gtgaaagttg tgcttttgtt tcttaagaaa 120 

aaggcatatt tttaatactt gattcttggt tcttgattct tgattcttgg ttttttttag 180 

cttcttaagt tcggtg atg teg tct tec ace aat gac tac aac gat ggt aat 232 
Met Ser Ser Ser Thr Asn Asp Tyr Asn Asp Gly Asn 
15 10 



aac aat gga gtg tac cct etc tct ctt tac ctt tct tea etc tct ggc 
Asn Asn Gly Val Tyr Pro Leu Ser Leu Tyr Leu Ser Ser Leu Ser Gly 
15 20 25 



280 



cat caa gac ate att cat aat ccc tac aac cat cag tta aaa gca tct 328 

His Gin Asp He He His Asn Pro Tyr Asn His Gin Leu . Lys Ala Ser 
30 35 40 

ccg ggc cat atg gta tea gca gtt cct gaa tct ctg ate gat tac atg 376 

Pro Gly His Met Val Ser Ala Val Pro Glu Ser Leu He Asp Tyr Met 

45 50 55 60 

gcg ttt aag tea aat aat gtt gtg aat caa caa ggc ttt gag ttt cct 424 

Ala Phe Lys Ser Asn Asn Val Val Asn Gin Gin Gly Phe Glu Phe Pro 
65 70 75 

gag gtg tea aag gaa ate aag aag gtg gtg aag aag gac cga cat age 472 

Glu Val Ser Lys Glu He Lys Lys Val Val Lys Lys Asp Arg His Ser 
80 85 90 

aag att caa acg gca caa ggg att aga gac agg agg gtt agg ctt ttt 520 

Lys He Gin Thr Ala Gin Gly He Arg Asp Arg Arg Val Arg Leu Phe 
95 100 105 

att ggg att get cgc caa ttc ttt gat ctt cag gat atg ttg ggg ttt 568 

He Gly He Ala Arg Gin Phe Phe Asp Leu Gin Asp Met Leu Gly Phe 
110 115 120 

gat aaa get agt aaa acg tta gac tgg ctg etc aag aag tea aga aaa 616 

Asp LyB Ala Ser Lys Thr Leu Asp Trp Leu Leu Lys Lys Ser Arg Lys 

125 130 135 140 

gee ate aaa gag gtc gta caa gca aaa aac etc aac aat gat gat gaa 664 

Ala lie Lys Glu Val Val Gin Ala Lys Asn Leu Asn Asn Asp Asp Glu 
145 150 155 
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CT *t- ttt oaa aac att qga ggc gat gta gaa caa gaa gag gag aag gag 
Isp Pne ll? Asn Tie l?y Sy Asp Val Glu Gin Glu Olu Glu Lys Olu 
160 165 

o aa oat aac aat qqc gat aag age ttc gtg tat ggt ttg age ccc ggg 
III lit IS Asn 8? Asp Ly! sir Phe Val Tyr Gly Leu Ser Pro Gly 
175 180 185 

£ E B 1" e o- E EE K S £ K 1?? e E 
SS E S S B H 5 = S S S 55 B S « K 

205 210 215 

aaa act aaa gga aaa gca aag gag cga aca aaa gag atg atg gee tat 

tys III Sg Gly Lys Ala Lys Glu Arg Thr Lys Glu Met Met Ala Tyr 

225 230 

aat aat cca gag act gee tct gat att aca caa tct gaa ate atg gac 
1% Asn Pro Glu Thr Ala Ser Asp lie Thr Gin Ser Glu lie Met Asp 
240 245 

cca ttc aag agg tct ata gtc ttc aat gaa gga gaa gat atg aca cac 
Pro Phe Lys Arg Ser lie Val Phe Asn Glu Gly Glu Asp Met Thr His 
255 260 

S E E W K - S S! 85 E B B S E E E 

270 275 280 

t-t-a acc aat atq act eta cca acg aag atg ggt caa agt tac aat caa 
Leu Thr Asn Me? Thr Leu Pro Thr Lys Met Gly Gin Ser Tyr Asn Gin 
285 2 9° 295 

a *t- » a t- oflfl ata ctt ata ttg gta gat cag agt tct age age aac tat 
Asn Asn l!y He Zeu Me? lei lal Asp Gin Ser Ser Ser Ser Asn Tyr 
305 310 

ffi S is S B E K H B E E SI B S E S 

320 325 

SB Si E E SI £ £ E E E E B B 5S E E 

335 340 

ggt ttc eta taa atctcgacag ttttgaagga etatgeatga tcaagtttaa 
Gly Phe Leu 
350 

acatgtaagc caatatagtc ccttattcct ctgaatgtat acaaaatcta tagttatgta 
tatctgttcc tttttaacgt atctttattg atcttctgtg ccttgatcaa aattgtcatt 
ttaagattca gtttgtgtaa tattttagct acaactttta agtggtatta ttgtaacctt 
ttgaactata tattttgaag atgaataaga acatgtttat ataaaaa 



712 

760 

808 

856 

904 

952 

1000 

1048 

1096 

1144 

1192 

1240 

1292 

1352 
1412 
1472 
1519 



<210> 28 

<211> 351 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 28 

Met Ser Ser Ser Thr Asn Asp Tyr Asn Asp Gly Asn Asn Asn Gly Val 
1 5 10 AD 

Tyr Pro Leu Ser Leu Tyr Leu Ser Ser Leu Ser Gly His Gin Asp lie 
20 25 30 
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He His Asn Pro Tyr Asn His Gin Leu Lys Ala Ser Pro Gly His Met 
35 40 45 

Val Ser Ala Val Pro Glu Ser Leu He Asp Tyr Met Ala Phe Lys Ser 
50 55 60 

Asn Asn Val Val Asn Gin Gin Gly Phe Glu Phe Pro Glu Val Ser Lys 
65 70 75 80 

Glu He Lys Lys Val Val Lys Lys Asp Arg His Ser Lys He Gin Thr 
85 90 95 

Ala Gin Gly He Arg Asp Arg Arg Val Arg Leu Phe He Gly He Ala 
100 105 HO 

Arg Gin Phe Phe Asp Leu Gin Asp Met Leu Gly Phe Asp Lys Ala Ser 
115 120 125 

Lys Thr Leu Asp Trp Leu Leu Lys Lys Ser Arg Lys Ala He Lys Glu 
130 135 140 

Val Val Gin Ala Lys Asn Leu Asn Asn Asp Asp Glu Asp Phe Gly Asn 
145 150 155 160 

He Gly Gly Asp Val Glu Gin Glu Glu Glu Lys Glu Glu Asp Asp Asn 
165 170 175 

Gly Asp Lys Ser Phe Val Tyr Gly Leu Ser Pro Gly Tyr Gly Glu Glu 
180 185 190 

Glu Val Val Cys Glu Ala Thr Lys Ala Gly He Arg Lys Lys Lys Ser 
195 200 205 

Glu Leu Arg Asn lie Ser Ser Lys Gly Leu Gly Ala Lys Ala Arg Gly 
210 215 220 

Lys Ala Lys Glu Arg Thr Lys Glu Met Met Ala Tyr Asp Asn Pro Glu 
225 230 235 240 

Thr Ala Ser Asp lie Thr Gin Ser Glu He Met Asp Pro Phe Lys Arg 
245 250 255 

Ser He Val Phe Asn Glu Gly Glu Asp Met Thr His Leu Phe Tyr Lys 
260 265 270 

Glu Pro He Glu Glu Phe Asp Asn Gin Glu Ser He Leu Thr Asn Met 
275 280 285 

Thr Leu Pro Thr Lys Met Gly Gin Ser Tyr Asn Gin Asn Asn Gly He 
290 ' 295 300 

Leu Met Leu Val Asp Gin Ser Ser Ser Ser Asn Tyr Asn Thr Phe Leu 
305 310 315 320 

Pro Gin Asn Leu Asp Tyr Ser Tyr Asp Gin Asn Pro Phe His Asp Gin 
325 330 335 
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Thr Leu Tyr Val Val Thr Asp Lys Asn Phe Pro Lys Gly Phe Leu 
340 345 



<210> 29 

<211> 974 

<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> CDS 

<222> (62).. (874) 

<223> G1073 



ccccccgacc tgcctctaca gagacctgaa gattccagaa ccccacctga tcaaaaataa 

■ s a a ss a s e e a be a e « « s 

SS S B B E S B B K S E S B S S B 

20 25 

a a ss s a a a a k a is s = a ss s 

35 40 * s 

a a a a a a a a & a ss « a a ss ss 

55 &u 



a a si a a a is s a a a a k e is s 

65 70 75 

a e a a a a a a a a a a is a a a 

85 90 

a s ss a s a a ss a si a a a a a 

100 105 

a e a a a ss a a a a a a a a ss a 

115 120 J"* 3 

a h a s;s si a b ss si a e a a a s a 

130 135 

a a e a a s a ss e e a a a a ss a 

145 150 ■ L3S 

a a a a a a si e a a a a a a a is 
ss e a a a a a e a b e ss a s ss si 

180 185 

s s s s a « si si as si si e e a a a 



195 



a e ?ss s e a a s si b a n ss ss a ss 



157 



205 



253 



301 



349 



397 



445 



493 



541 



589 



637 



685 



733 



210 215 220 
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ggt gga aat ggt gga gga ggt gtt get ttc tac aat ctt gga atg aat 781 
Gly Gly Asn Gly Gly Gly Gly Val Ala Phe Tyr Aen Leu Gly Met Asn 
225 230 * 235 240 

atg aac aat ttt caa ttc tec ggg gga gat att tac ggt atg age ggc 829 
Met Asn Asn Phe Gin Phe Ser Gly Gly Asp lie Tyr Gly Met Ser Gly 
245 250 255 

ggt age gga gga ggt ggt ggc ggt gcg act aga ccc gcg ttt tag 874 
Gly Ser Gly Gly Gly Gly Gly Gly Ala Thr Arg Pro Ala Phe 
260 265 270 

agttttagcg ttttggtgac accttttgtt gcgtttgcgt gtttgacctc aaactactag 934 

gctactagct atagcggttg egaaatgega atattaggtt 974 

<210> 30 
<211> 270 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 30 

Met Glu Leu Asn Arg Ser Glu Ala Asp Glu Ala Lys Ala Glu Thr Thr 
15 10 15 

Pro Thr Gly Gly Ala Thr Ser Ser Ala Thr Ala Ser Gly Ser Ser Ser 
20 25 30 

Gly Arg Arg Pro Arg Gly Arg Pro Ala Gly Ser Lys Asn Lys Pro Lys 
35 40 45 

Pro Pro Thr lie lie Thr Arg Asp Ser Pro Asn Val Leu Arg Ser His 
50 55 60 

Val Leu Glu Val Thr Ser Gly Ser Asp He Ser Glu Ala Val Ser Thr 
65 70 75 80 

Tyr Ala Thr Arg Arg Gly Cys Gly Val Cys He He Ser Gly Thr Gly 
85 90 95 

Ala Val Thr Asn Val Thr He Arg Gin Pro Ala Ala Pro Ala Gly Gly 
100 105 110 



Gly Val He Thr Leu His Gly Arg Phe Asp He Leu Ser Leu Thr Gly 
115 120 125 

Thr Ala Leu Pro Pro Pro Ala Pro Pro Gly Ala Gly Gly Leu Thr Val 
130 135 140 

Tyr Leu Ala Gly Gly Gin Gly Gin Val Val Gly Gly Asn Val Ala Gly 
145 150 155 160 

Ser Leu He Ala Ser Gly. Pro Val Val Leu Met Ala Ala Ser Phe Ala 
165 170 175 

ABn Ala Val Tyr Asp Arg Leu Pro He Glu Glu Glu Glu Thr Pro Pro 
180 185 190 



Pro Arg Thr Thr Gly Val Gin Gin Gin Gin Pro Glu Ala Ser Gin Ser 
195 200 205 

Page 50 



PCTAJS00/31325 

WO 01/36444 

MBI0018 Sequence Listing.ST25 

Ser Glu Val Thr Gly Ser Gly Ala Gin Ala Cys Glu Ser Asn Leu Gin 
210 215 

Gly Gly Asn Gly Gly Gly Gly Val Ala Phe Tyr Asn Leu Gly Met Asn 
225 230 

Met Asn Asn Phe Gin Phe Ser Gly Gly Asp He Ty* Gly Met Ser Gly 
245 

Gly Ser Gly Gly Gly Gly Gly Gly Ala Thr Arg Pro Ala Phe 
260 265 



<210> 31 

<211> 2010 

<212> DNA . 

<213> Arabidopsis thaliana 

<220> 

<221> CDS 

<222> U)..(2010) 

<223> G1493 



aas a s a a! a a s « a e « f = " 
s is 15 a a a k a « a « s e f* ; « s? " 

20 25 

ss a a is a 3? e a is e a a a a a s 

35 40 

a S3 a a a a a ss a s a. s a e e a 
s: ss its s k e a a a = = a 5 = a 
su a? a a s e a a B £ a a a a a 

IS SS W S S B B 6 S 2 C B B S S S 

100 105 

S B SS B S3 B B IS 5 S O « i S « 3 " 

115 120 

-,5? a a s a b b a a a a a a a a ss 

130 135 

a a a a s a a a a a a a a a e a 

145 150 

s a a b a a a? a a a e a a a a a 



165 

cag cag caa cat aga gga ggt ggt ggt ggt gca get gtt tct ggt gga 
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192 
240 
288 
336 
384 
432 
480 
528 



576 
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Gin Gin Gin His Arg Gly Gly Gly Gly Gly Ala Ala Val Ser Gly Gly 
180 185 190 

gag gat gcg gtg gat gat aac tea tec teg gtt aac gaa ggt aac aat 624 

Glu Asp Ala Val Asp Asp Asn Ser Ser Ser Val Asn Glu Gly Asn Asn 
195 200 205 

tgg agg age agt tea egg aag agg aaa gac gag gaa gga gaa gag caa 672 

Trp Arg Ser Ser Ser Arg Lys Arg Lys Asp Glu Glu Gly Glu Glu Gin 

210 215 220 

gga gat gat aag gac gaa gat gcg teg aat ttg aag aaa ccg cgt gtc 720 

Gly Asp Asp Lys Asp Glu Asp Ala Ser Asn Leu Lys Lys Pro Arg Val 
225 230 235 240 

gtc tgg tct gtt gaa ttg cat cag cag ttt gtt get get gtt aat cag 768 

Val Trp Ser Val Glu Leu His Gin Gin Phe Val Ala Ala Val Asn Gin 
245 250 255 

etc ggc gtt gag aag gcg gtt cct aaa aag ate tta gag ctg atg aat 816 

Leu Gly Val Glu Lys Ala Val Pro Lys Lys He Leu Glu Leu Met Asn 
260 265 270 



gtt cct ggt eta acc cga gaa aac gta gca agt cac etc cag aaa tac 
Val Pro Gly Leu Thr Arg Glu Asn Val Ala Ser His Leu Gin Lys Tyr 
275 ~ 280 285 



864 



egg ata tat eta aga egg ctt gga ggg gta teg cag cac caa ggc aat 912 
Arg He Tyr Leu Arg Arg Leu Gly Gly Val Ser Gin His Gin Gly Asn 
290 295 300 

ctt aac aac teg ttt atg acg ggt cag gat gcg age ttc gga cct ctt 960 
Leu Asn Asn Ser Phe Met Thr Gly Gin Asp Ala Ser Phe Gly Pro Leu 
305 310 315 320 

teg aca ttg aat ggg ttt gat ctt caa gca eta gee gtc aca ggt cag 1008 
Ser Thr Leu Asn Gly Phe Asp Leu Gin Ala Leu Ala Val Thr Gly Gin 
325 330 335 

tta cct gca cag agt ctt gca cag ctt caa gec get ggt tta ggc egg 1056 
Leu Pro Ala Gin Ser Leu Ala Gin Leu Gin Ala Ala Gly Leu Gly Arg 
340 345 350 

cct gcg atg gtc tct aag tea ggt ttg ccg gtt tec tec att gtg gat 1104 
Pro Ala Met Val Ser Lys Ser Gly Leu Pro Val Ser Ser He Val Asp 
355 360 365 

gag aga age ate ttc age ttt gac aac acg aaa aca aga ttt gga gaa 1152 
Glu Arg Ser He Phe Ser Phe Asp Asn Thr Lys Thr Arg Phe Gly Glu 
370 375 380 

ggg ctt ggg cat cac ggg caa caa ccc caa cag caa cca cag atg aac 1200 
Gly Leu Gly His His Gly Gin Gin Pro Gin Gin Gin Pro Gin Met Asn 
385 390 395 400 

tta ctt cac ggt gtc ccc acg ggt tta caa cag cag ctt cct atg ggt 1248 
Leu Leu His Gly Val Pro Thr Gly Leu Gin Gin Gin Leu Pro Met Gly 
405 410 415 

aat cga atg agt att caa caa cag att get get* gtt cga get gga aat 1296 
Asn Arg Met Ser He Gin Gin Gin He Ala Ala Val Arg Ala Gly Asn 
420 425 430 

agt gtt caa aac aac gga atg ctg atg cct eta gcg ggt cag cag tct 1344 
Ser Val Gin Asn Asn Gly Met Leu Met Pro Leu Ala Gly Gin Gin Ser 
435 440 445 

ttg cct egg gga cca ccg cct atg eta acc tct teg caa tea tec ate 13 92 
Leu Pro Arg Gly Pro Pro Pro Met Leu Thr Ser Ser Gin Ser Ser He 
450 455 460 

agg cag ccg atg tta tea aac cgc att tec gag aga agt ggt ttc tct 144 0 
Arg Gin Pro Met Leu Ser Asn Arg He Ser Glu Arg Ser Gly Phe Ser 
465 470 ~ 475 . 480 
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e ss s s ill ~ R « = 1 « a s ?5? is ^ 
s s e s s e ss s s K ls 52 55 i K 

500 50;> 

e s e e s e s s as e s e s e ss 
s c is e a s k s ss s a e b « 

535 540 

s e k m » S E !5 B e k s s s 

550 555 

SS E e s s S3 in S E R k e e 

565 570 



ggg ata 
Gly He 
530 

aac age 
Asn Ser 
545 

acc aga 
Thr Arg 



E E a Bp IS C S B E E SS E SS E E E 

580 585 

s s a is » is 5 ss s a ss e e e e k 

595 600 

E E iS = K SS £ E S SS S E SS E 5" 

610 615 

SS IS E E E E - E E IS E SS « 5 E B 

625 " 630 

HI E E S E S K - K £ E E SS E 85 
E E C S 58 SS K S S E S E ~ 

660 665 



1488 



1536 



1584 



1632 



1680 



1728 



1776 



1824 



1872 



1920 



1968 



2010 



<210> 32 
<211> 669 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 32 

Met Met Asn Pro Ser His Gly Arg Gly Leu Gly Ser Ala Gly Gly Ser 



10 



Ser 



Ser Gly Arg Asn Gin Gly Gly Gly Gly Glu Thr Val Val Glu Met 



20 



Phe Pro 



Ser Gly Leu Arg Val Leu Val Val Asp Asp Asp Pro Thr Cys 



35 



40 



Leu Met He Leu 
50 



Glu Arg Met Leu Arg Thr Cys Leu Tyr Glu Val Thr 

6 0 



55 



Lys 
65 



Cys Asn Arg Ala Glu Met Ala Leu Ser Leu Leu Arg Lys Asn Lys 
70 



75 
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His Gly Phe Asp He Val He Ser Asp Val His Met Pro Asp Met Asp 
85 90 95 

Gly Phe Lys Leu Leu Glu His Val Gly Leu Glu Met Asp Leu Pro Val 
100 105 HO 

He Met Met Ser Ala Asp Asp Ser Lys Ser Val Val Leu Lys Gly Val 
115 120 125 

Thr His Gly Ala Val Asp Tyr Leu He Lys Pro Val Arg Met Glu Ala 
130 135 140 

Leu Lys Asn He Trp Gin His Val Val Arg Lys Arg Arg Ser Glu Trp 
145 150 155 160 

Ser Val Pro Glu His Ser Gly Ser He Glu Glu Thr Gly Glu Arg Gin 
165 170 175 

Gin Gin Gin His Arg Gly Gly Gly Gly Gly Ala Ala Val Ser Gly Gly 
180 185 . 190 

Glu Asp Ala Val Asp Asp Asn Ser Ser Ser Val Asn Glu Gly Asn Asn 
195 200 205 

Trp Arg Ser Ser Ser Arg Lys Arg Lys Asp Glu Glu Gly Glu Glu Gin 
210 215 220 

Glv Asp Asp Lys Asp Glu Asp Ala Ser Asn Leu Lys Lys Pro Arg Val 
,225 230 235 240 

Val Trp Ser Val Glu Leu His Gin Gin Phe Val Ala Ala Val Asn Gin 
245 250 255 

Leu Gly Val Glu Lys Ala Val Pro Lys Lys He Leu Glu Leu Met Asn 
260 265 270 

Val Pro Gly Leu Thr Arg Glu Asn Val Ala Ser His Leu Gin Lys Tyr 

275 280 285 / 

Arg He Tyr Leu Arg Arg Leu Gly Gly Val Ser Gin His Gin Gly Asn 
290 295 300 

Leu Asn Asn Ser Phe Met Thr Gly Gin Asp Ala Ser Phe Gly Pro Leu 
305 310 315 320 

Ser Thr Leu Asn Gly Phe Asp Leu Gin Ala Leu Ala Val Thr Gly Gin 
325 330 335 



Leu Pro Ala Gin Ser Leu Ala Gin Leu Gin Ala Ala Gly Leu Gly Arg 
340 345 350 

Pro Ala Met Val Ser Lys Ser Gly Leu Pro Val Ser Ser lie Val Asp 

355 360 365 

Glu Arg Ser lie Phe Ser Phe Asp Asn Thr Lys Thr Arg Phe Gly Glu 

370 375 380 
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Gly Leu Gly His His Gly Gin Gin Pro Gin Gin Gin Pro Gin Met Asn 

L eu Leu His Gly Val Pro Thr Gly Leu Gin Gin Gin Leu Pro Met Gly 
405 410 

Asn Arg Met Ser lie Gin Gin Gin Ue Ala Ala Val Arg Ala Gly Asn 
420 425 

Ser Val Gin Asn Asn Gly Met Leu Met Pro Leu Ala Gly Gin Gin Ser 
435 440 

Leu Pro Arg Gly Pro Pro Pro Met Leu Thr Ser Ser Gin Ser Ser He 
450 455 

Arg Gin Pro Met Leu Ser Asn Arg He Ser Glu Arg Ser Gly Phe Ser 

Gly Arg Asn Asn lie Pro Glu Ser Ser Arg Val Leu Pro Thr Ser Tyr 

Thr Asn Leu Thr Thr Gin His Ser Ser Ser Ser Met Pro Tyr Asn Asn 
500 505 

Phe Gin Pro Glu Leu Pro Val Asn Ser Phe Pro Leu Ala Ser Ala Pro 
515 520 

Gly lie Ser Val Pro Val Arg Lys Ala Thr Ser Tyr G!n Glu Glu Val 
530 535 

Asn Ser Ser Glu Ala Gly Phe lie Thr Pro Ser Tyr Asp Met Phe Thr 
545 550 

Thr Arg Gin Asn Asp Trp Asp Leu Arg Asn lie Gly He Ala Phe Asp 
565 570 

Ser His Gin Asp Ser Glu Ser Ala Ala Phe Ser Ala Ser Glu Ala Tyr 
580 585 

Ser Ser Ser Ser Met Ser Arg His Asn Thr Thr Val Ala Ala Thr Glu 
595 600 

His Gly Arg Asn His Gin Gin Pro Pro Ser Gly Met Val Gin His His 
610 615 

Gin val Tyr Ala Asp Gly Asn Gly Gly Ser Val Arg Val Lys Ser Glu 

625 630 

Arg Val Ala Thr Asp Thr Ala Thr Met Ala Phe His Glu Gin Tyr Ser 



Asn 



645 6 50 
Gin Glu Asp Leu Met Ser Ala Leu Leu Lys Gin Val 



660 



665 



<210> 33 
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<211> 1239 

<212> DNA 

<213> Arabidopsis thai i ana 
<220> 

<221> CDS 

<222> (6).- (1091) 

<223> G993 

<400> 33 

caaat atg gaa tac age tgt gta gac gac agt agt aca acg tea gaa tct 50 

Met Glu Tyr Ser Cys Val Asp Asp Ser Ser Thr Thr Ser Glu Ser 
15 10 15 



etc tec ate tct act act cca aag ccg aca acg acg acg gag aag aaa 
Leu Ser He Ser Thr Thr Pro Lys Pro Thr Thr Thr Thr Glu Lys Lys 
20 25 30 



98 



290 



338 



386 



etc tct tct ccg ccg gcg acg teg atg cgt etc tac aga atg gga age 146 
Leu Ser Ser Pro Pro Ala Thr Ser Met Arg Leu Tyr Arg Met Gly Ser 
35 40 45 

ggc gga age age gtc gtt ttg gat tea gag aac ggc gtc gag ace gag 194 
Gly Gly Ser Ser Val Val Leu Asp Ser Glu Asn Gly Val Glu Thr Glu 
50 55 60 

tea cgt aag ctt cct teg teg aaa tat aaa ggc gtt gtg cct cag cct 242 
Ser Arg Lys Leu Pro Ser Ser Lys Tyr Lys Gly Val Val Pro Gin Pro 
65 70 75 

aac gga aga tgg gga get cag att tac gag aag cat cag cga gtt tgg 
Asn Gly Arg Trp Gly Ala Gin He Tyr Glu Lys His Gin Arg Val Trp 
80 ' 85 90 95 

etc ggt act ttc aac gag gaa gaa gaa get gcg tct tct tac gac ate 
Leu Gly Thr Phe Asn Glu Glu Glu Glu Ala Ala Ser Ser Tyr Asp He 
100 105 HO 

gcc gtg agg aga ttc cgc ggc cgc gac gcc gtc act aac ttc aaa tct 
Ala Val Arg Arg Phe Arg Gly Arg Asp Ala Val Thr Asn Phe Lys Ser 
115 120 125 

caa gtt gat gga aac gac gcc gaa teg get ttt ctt gac get cat tct 434 
Gin Val Asp Gly Asn Asp Ala Glu Ser Ala Phe Leu Asp Ala His Ser 
130 135 140 

aaa get gag ate gtg gat atg ttg agg aaa cac act tac gcc gat gag 482 
Lys Ala Glu He Val Asp Met Leu Arg Lys His Thr Tyr Ala Asp Glu 
145 150 155 

ttt gag cag agt aga egg aag ttt gtt aac ggc gac gga aaa cgc tct 530 
Phe Glu Gin Ser Arg Arg Lys Phe Val Asn Gly Asp Gly Lys Arg Ser 
160 165 170 . 175 

ggg ttg gag acg gcg acg tac gga aac gac get gtt ttg aga gcg cgt 578 
Gly Leu Glu Thr Ala Thr Tyr Gly Asn Asp Ala Val Leu Arg Ala Arg 
180 * 185 190 

gag gtt ttg ttc gag aag act gtt acg ccg age gac gtc ggg aag ctg 626 
Glu Val Leu Phe Glu Lys Thr Val Thr Pro Ser Asp Val Gly Lys Leu 
195 200 205 

aac cgt tta gtg ata ccg aaa caa cac gcg gag aag cat ttt ccg tta 674 
Asn 'Arg Leu Val He Pro Lys Gin His Ala Glu Lys HiB Phe Pro Leu 
210 215 220 

ccg gcg atg acg acg gcg atg ggg atg aat ccg tct ccg acg aaa ggc 722 
Pro Ala Met Thr Thr Ala Met Gly Met Asn Pro Ser Pro Thr Lys Gly 
225 230 235 

gtt ttg att aac ttg gaa gat aga aca ggg aaa gtg tgg. egg ttc cgt 770 
Val Leu He Asn Leu Glu Asp Arg Thr Gly Lys Val Trp Arg Phe Arg 
240 245 . 250 255 
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s s s: s ss a s ss is i a a s £ s 

260 265 

s a s e is 5! ss i s e « e g c & 

275 280 

a ss s is s a? s g a a a - s k s s 

290 295 

e si s s a e a si e is a s s w e s 

305 310 

as s a a = as k s s g e e e is | 

320 325 

«S K S! 5 S B 21 K S g S S S S f §5 

340 " 5 * 3 

tec aag aag cag gcg att ate aac ate ttg tga caaattettt ttttttggtt 
Ser Ly! Lyl Gin Ala lie He Asn lie Leu 

355 360 
tttttettea atttgtttet ecttttteaa tattttgtat tgaaatgaea agttgtaaat 
taggaeaaga eaagaaaaaa tgacaaetag aeaaaatagt ttttgtttaa aaaaaaaaaa 
aaaaaaaa 



81B 

866 

914 

962 

1010 

1058 

1111 

1171 
1231 
1239 



<210> 34 
<211> 361 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 34 

Met Glu Tyr Ser Cys Val Asp Asp Ser Ser Thr Thr Ser Glu Ser Leu 

Ser lie Ser Thr Thr Pro Lys Pro Thr Thr Thr Thr Glu Lys Lys Leu 
20 25 

Ser Ser Pro Pro Ala Thr Ser Met Arg Leu Tyr Arg Met Gly Ser Gly 
35 40 

Gly Ser Ser Val Val Leu Asp Ser Glu Asn Gly Val Glu Thr Glu Ser 
50 55 

Ar g Lys Leu Pro Ser Ser Lys Tyr Lys Gly Val Val Pro Gin Pro Asn 
65 70 

Gly Arg Trp G!y Ala Gin He Tyr Glu Lys His Gin Arg Val Trp Leu 
8 5 



Gly Thr Phe Asn Glu Glu Glu Glu Ala Ala Ser Ser Tyr Asp lie Ala 
100 105 

val Arg Arg Phe Arg Gly Arg Asp Ala Val Thr Asn Phe Lys Ser Gin 
115 120 

val Asp Gly Asn Asp Ala Glu Ser Ala Phe Leu Asp Ala His Ser Lys 
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130 135 140 

Ala Glu lie Val Asp Met Leu Arg Lys His Thr Tyr Ala Asp Glu Phe 
145 * 150 155 160 



Glu Gin Ser Arg Arg Lys Phe Val Asn Gly Asp Gly Lys Arg Ser Gly 
165 170 175 

Leu Glu Thr Ala Thr Tyr Gly Asn Asp Ala Val Leu Arg Ala Arg Glu 
180 185 190 

. Val Leu Phe Glu Lys Thr Val Thr Pro Ser Asp Val Gly Lys Leu Asn 
195 200 205 



Arg Leu Val lie Pro Lys Gin His Ala Glu Lys His Phe Pro Leu Pro 
210 215 220 

Ala Met Thr Thr Ala Met Gly Met Asn Pro Ser Pro Thr Lys Gly Val 
225 230 235 240 

Leu He Asn Leu Glu Asp Arg Thr Gly Lys Val Trp Arg Phe Arg Tyr 
245 250 255 

Ser Tyr Trp Asn Ser Ser Gin Ser Tyr Val Leu Thr Lys Gly Trp Ser 
260 265 270 



Arg Phe Val Lys Glu Lys Asn Leu Arg Ala Gly Asp Val Val Cys Phe 
275 280 285 



Glu Arg Ser . Thr Gly Pro Asp Arg Gin Leu Tyr He His Trp Lys Val 
290 295 300 

Arg Ser Ser Pro Val Gin Thr Val Val Arg Leu Phe Gly Val Asn He 

305 310 315 320 

Phe Asn val Ser Asn Glu Lys Pro Asn Asp Val Ala Val Glu Cys Val 

325 330 335 



Gly Lys Lys Arg Ser Arg Glu Asp Asp Leu Phe Ser Leu Gly Cys Ser 
340 345 350 



Lys Lys Gin Ala He He Asn He Leu 
355 360 



<210> 


35 


<211> 


1281 


<212> 


DNA 


<213> 


Arabidopsis thaliana 


<220> 




<221> 


CDS 


<222> 


(64) . . (1098) 


<223> 


G867 



<400> 35 

cacaacacaa acacatttct gttttctcca ttgtttcaaa ccataaaaaa aaacacagat 

taa atg gaa teg agt age gtt gat gag agt act aca agt aca ggt tec 
Met Glu Ser Ser Ser Val Asp Glu Ser Thr Thr Ser Thr Gly Ser 
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1 

lit £ K S S S - - « S S « s " r 5 R 

20 

is; a a a a s a b is is e a a s a e 

s e e e a e a 3 a s B 8 5 w ~ * L " 
? !5 si a a a a a R a si w e s s - as 
;s a a o a a a? a a a a c a e e 

80 85 

k a k a s e b e a a a a a a a k 
e s a a a a o a s c k a e a a a 

115 

us a a a a a a e a e a a a a a a 

130 

a «t- aaa cqq cgt cgt aat ggt aac 

s s a e s a « a s a a a M » - 

145 15 i t 

a; s » a a s a a a a £ a s a a a 

160 

e a s a « a s a s e s a a e a a 
« s a s a e s a a a a a a a a a 

195 

s s s a a a a a a a a a a a a e 

210 21b 

is m; e a a a a is a a a ft? a o s a 

225 230 

s a a a a a a a a s a a e a s a 

240 

is ss a si a a a s s a a a a a a « 

260 

c s a a a a a % a s s a a a s e 
?4 si a a a » a a a a s a g e a a 

290 Zyb 

* tt tea ccg gag agt tea aga aac gac gtc gta 
ttg ttc gga gtt aac att tea ccg g g g^ ^ 



156 



204 



252 



300 



444 



492 



540 



684 



732 



780 



924 



972 



1020 
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Leu Phe Gly Val Asn He Ser Pro Glu Ser Ser Arg Asn Asp Val Val 
305 310 315 

gga aac aaa aga gtg aac gat act gag atg tta teg ttg gtg tgt age 1068 
Gly Asn Lys Arg Val Asn Asp Thr Glu Met Leu Ser Leu Val Cys Ser 
320 ~ 325 330 335 

aag aag caa cgc ate ttt cac gee teg taa caactcttct tctttttttt 1118 
Lys Lys Gin Arg He Phe His Ala Ser 
340 

tcttttgttg ttttaataat ttttaaaaac tccattttcg ttttctttat ttgcatcggt 1178 

ttctttcttc ttgtttacca aaggttcatg agttgttttt gttgtattga tgaactgtaa 1238 

attttattta taggataaat tttaaaaaaa aaaaaaaaaa aaa 1281 

<210> 36 

<211> 344 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 36 

Met Glu Ser Ser Ser val Asp Glu Ser Thr Thr Ser Thr Gly Ser He 
1 .5 10 15 

Cys Glu Thr Pro Ala He Thr Pro Ala Lys Lys Ser Ser Val Gly Asn 
20 25 30 

Leu Tyr Arg Met Gly Ser Gly Ser Ser Val Val Leu Asp Ser Glu Asn 
35 40 45 

Gly Val Glu Ala Glu Ser Arg Lys Leu Pro Ser Ser Lys Tyr Lys Gly 
50 55 60 

Val Val Pro Gin Pro Asn Gly Arg Trp Gly Ala Gin He Tyr Glu Lys 
65 70 75 80 

His Gin Arg Val Trp Leu Gly Thr Phe Asn Glu Glu Asp Glu Ala Ala 
85 90 95 

Arg Ala Tyr Asp Val Ala Val His Arg Phe Arg Arg Arg Asp Ala Val 
100 105 110 

Thr Asn Phe Lys Asp Val Lys Met Asp Glu Asp Glu Val Asp Phe Leu 
115 120 125 

Asn Ser His Ser Lys Ser Glu lie Val Asp Met Leu Arg Lye His Thr 
130 135 .140 

Tyr Asn Glu Glu Leu Glu Gin Ser Lys Arg Arg Arg Asn Gly Asn Gly 
145 150 155 160 

Asn Met Thr Arg Thr Leu Leu Thr Ser Gly Leu Ser Asn Asp Gly Val 
• 165 170 175 

Ser Thr Thr Gly Phe Arg Ser Ala Glu Ala Leu Phe Glu Lys Ala Val 
180 185 190 

Thr Pro Ser Asp Val Gly Lys Leu Asn Arg Leu Val He Pro Lys His 
195 200 205 
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His Ala Glu Lys His Phe Pro Leu Pro Ser Ser Asn Val Ser Val Lys 
210 215 220 

Gly Val Leu Leu Asn Phe Glu Asp Val Asn Gly Lys Val Trp Arg Phe . 
225 230 235 240 

Aro Tyr Ser Tyr Trp Asn Ser Ser Gin Ser Tyr Val Leu Thr Lys Gly 
- 1 ~ . - 250 255 



245 



Trp Ser Arg Phe Val Lys Glu Lys Asn Leu Arg Ala Gly Asp Val Val 
260 265 270 

Ser Phe Ser Arg Ser Asn Gly Gin Asp Gin Gin Leu Tyr He Gly Trp 
275 280 285 

Lys Ser Arg Ser Gly Ser Asp Leu Asp Ala Gly Arg Val Leu Arg Leu 
290 295 300 

Phe Gly val Asn lie Ser Pro Glu Ser Ser Arg Asn Asp Val Val Gly 
305 310 315 320 

Asn Lys Arg Val Asn Asp Thr Glu Met Leu Ser Leu Val Cys Ser Lys 
325 330 35b 

Lys Gin Arg He Phe His Ala Ser 
340 



<210> 


37 


<211> 


1155 


<212> 


DNA 


<213> 


Arabidopsis thaliana 


<220> 




<221> 


CDS 


<222> 


(76) . . (1077) 


<223> 


G1930 


<400> 


37 



atttcgtttc ataac atg gat gcc atg agt age gta gac gag age tct aea 



60 
111 











1 








5 








10 








act 
Thr 


aea 
Thr 


gat 
Asp 
15 


tec 
Ser 


att 
He 


ccg 
Pro 


gcg 
Ala 


aga 
Arg 
20 


aag 
Lys 


tea 
Ser 


teg tct 
Ser Ser 


ccg 
Pro 
25 


gcg 
Ala 


agt 
Ser 


tta 
Leu 


159 


eta 
Leu 


tat 
Tyr 
30 


aga 
Arg 


atg 
Met 


gga 
Gly 


age 
Ser 


gga 
Gly 
35 


aea 
Thr 


age 
Ser 


gtg 

Val 


gta ctt 
Val Leu 
40 


gat 
Asp 


tea 
Ser 


gag 
Glu 


aac 
Asn 


207 


ggt 
Gly 
45 


gtc 
val 


gaa 
Glu 


gtc 
Val 


gaa 
Glu 


gtc 
Val 
50 


gaa 
Glu 


gcc 
Ala 


gaa 
Glu 


tea 
Ser 


aga aag 
Arg Lys 
55 


ctt 
Leu 


cct 
Pro 


tct 
Ser 


tea 
Ser 
60 


255 


aga 
Arg 


ttc 
Phe 


aaa 
Lys 


ggt 
Gly 


gtt 
Val 
65 


gtt 
Val 


cct 
Pro 


caa 
Gin 


cca 
Pro 


aat 
Asn 
70 


gga aga 
Gly Arg 


tgg 
Trp 


gga 
Gly 


get 
Ala 
75 


cag 
Gin 


303 


att 
He 


tac 
Tyr 


gag 
Glu 


aaa 
Lys 
80 


cat 
His 


caa 
Gin 


cgc 
Arg 


gtg 
Val 


tgg 
Trp 

85 


ctt 
Leu 


ggt act 
Gly Thr 


ttc 
Phe 


aac 
Asn 
90 


gag 
Glu 


gaa 
Glu 


351 
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gac gaa 
Asp Glu 


gca 
Ala 
95 


get 
Ala 


cgt get tac 
Arg Ala Tyr 


gac 
Asp 
100 


gtc 
Val 


gcg 
Ala 


get 
Ala 


cac 
His 


cgt ttc 
Arg Phe 
105 


cgt ggc 
Arg Gly 


399 


cgc gat 
Arg Asp 
110 


gec 
Ala 


gtt 
Val 


act aat ttc 
Thr Asn Phe 
115 


aaa 
Lys 


gac 
Asp 


acg 
Thr 


acg 
Thr 


ttc 
Phe 
120 


gaa gaa 
Glu Glu 


gag gtt 
Glu Val 


447 


gag ttc 
Glu Phe 
125 


tta 
Leu 


aac 
Asn 


gcg cat teg 
Ala His Ser 
130 


aaa 
Lys 


tea 
Ser 


gag 
Glu 


ate 
He 
135 


gta 
val 


gat atg 
Asp Met 


ttg aga 
Leu Arg 
140 


495 


aaa cac 
Lys His 


act 
Thr 


tac 
Tyr 


aaa gaa gag 
Lys Glu Glu 
145 


tta 
Leu 


gac 
Asp 


caa 
Gin 
150 


agg 
Arg 


aaa 
Lys 


cgt aac 
Arg Asn 


cgt gac 
Arg Asp 
155 


543 


ggt aac 
Gly Asn 


gga 
Gly 


aaa 
Lys 
160 


gag acg acg 
Glu Thr Thr 


gcg 
Ala 


ttt 
Phe 
165 


get 
Ala 


ttg 
Leu 


get 
Ala 


teg atg 
Ser Met 
170 


gtg gtt 
Val Val 


591 


atg acg 
Met Thr 


ggg 

Gly 
175 


ttt 
Phe 


aaa acg gcg 
Lys Thr Ala 


gag 
Glu 
180 


tta 
Leu 


ctg 
Leu 


ttt 
Phe 


gag 
Glu 


aaa acg 
Lys Thr 
185 


gta acg 
Val Thr 


639 


cca agt 
Pro Ser 
190 


gac 
Asp 


gtc 
Val 


ggg aaa eta 
Gly Lys Leu 
195 


aac 
Asn 


cgt 
Arg 


tta 
Leu 


gtt 
Val 


ata 
He 
200 


cca aaa 
Pro Lys 


cac caa 
His Gin 


687 


gcg gag 
Ala Glu 
205 


aaa 
Lys 


cat 
His 


ttt ccg tta 
Phe Pro Leu 
210 


ccg 
Pro 


tta 
Leu 


ggt 
Gly 


aat 
Asn 
215 


aat 
Asn 


aac gtc 
Asn Val 


tec gtt 
Ser Val 
220 


735 


aaa ggt 
Lys Gly 


atg 
Met 


ctg 
Leu 


ttg aat ttc 
Leu Asn Phe 
225 


gaa 
Glu 


gac 
Asp 


gtt 
Val 
230 


aac 
Asn 


ggg 

Gly 


aaa gtg 
Lys Val 


tgg agg 
Trp Arg 
235 


783 


ttc cgt 
Phe Arg 


tac 
Tyr 


tct 
Ser 
240 


tat tgg aat 
Tyr Trp Asn 


agt 
Ser 


agt 
Ser 
245 


caa 
Gin 


agt 
Ser 


tat 
Tyr 


gtg ttg 
Val Leu 
250 


ace aaa 
Thr Lys 


831 


ggt tgg 
Gly Trp 


agt 
Ser 
255 


aga 
Arg 


ttc gtt aaa 
Phe Val Lys 


gag 
Glu 
260 


aag 
Lys 


aga 
Arg 


ctt 
Leu 


tgt 
Cys 


get ggt 
Ala Gly 
265 


gat ttg 
Asp Leu 


879 


ate agt 
He Ser 
270 


ttt 
Phe 


aaa 
Lys 


aga tec aac 
Arg Ser Asn 
275 


gat 
Asp 


caa 
Gin 


gat 
Asp 


caa 
Gin 


aaa 
Lys 
280 


ttc ttt 
Phe Phe 


ate ggg 
He Gly 


927 


tgg aaa 
Trp Lys 
285 


teg 
Ser 


aaa 
Lys 


tec ggg ttg 
Ser Gly Leu 
290. 


gat 
Asp 


eta 
Leu 


gag 

Glu 


acg 
Thr 
295 


ggt 
Gly 


egg gtt 
Arg Val 


atg aga 
Met Arg 
300 


975 


ttg ttt 
Leu Phe 


ggg 

Gly 


gtt 
Val 


gat att tct 
Asp He Ser 
305 


tta 
Leu 


aac 
Asn 


gec 
Ala 
310 


gtc 
Val 


gtt 
val 


gta gtg 
Val Val 


aag gaa 
Lys Glu 
315 


1023 


aca acg 
Thr Thr 


gag 
Glu 


gtg 
Val 
320 


tta atg teg 
Leu Met Ser 


teg 
Ser 


tta 
Leu 
325 


agg 
Arg 


tgt 
Cys 


aag 
Lys 


aag caa 
Lys Gin 
330 


cga gtt 
Arg Val 


1071 



ttg taa taacaattta acaacttggg aaagaaaaaa aagctttttg attttaattt 1127 
Leu 

ctcttcaacg ttaatcttgc tgagatta H55 

<210> 38 

<211> 333 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 38 
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MBI0018 Sequence Listin 9- s J >25 ft ^ CoY _ 
Met Asp Ala Met Ser Ser Val Asp Glu Ser Ser Thr Thr Thr Asp Ser 



XI. Pro Ala Arg Lys Ser Ser Ser Pro Ala Ser Leu Leu Tyr Arg Met 
20 25 

Gly Ser Gly Thr Ser Val Val Leu Asp Ser Glu Asn Gly Val Glu Val 
35 40 

Glu val Glu Ala Glu Ser Arg Lys Leu Pro Ser Ser Arg Phe Lys Gly 



50 



55 



val val Pro Gin Pro Asn Gly Arg Trp Gly Ala Gin He Tyr Glu Lys 



65 



70 



His Gin Arg Val Trp Leu Gly Thr Phe Asn Glu Glu Asp Glu Ala Ala 
85 y " 

Arg Ala Tyr Asp val Ala Ala His Arg Phe Arg Gly Arg Asp Ala Val 
100 105 

Thr Asn Phe Lys Asp Thr Thr Phe Glu Glu Glu Val Glu Phe Leu Asn 
115 120 

Ala His Ser Lys Ser Glu lie Val Asp Met Leu Arg Lys His Thr Tyr 



130 



135 



Ly3 Glu Glu Leu Asp Gin Arg Lys Arg Asn Arg Asp Gly Asn Gly Lys 
145 150 

G lu Thr Thr Ala Phe Ala Leu Ala Ser Met Val Val Met Thr Gly Phe 
165 170 

Lys Thr Ala Glu Leu Leu Phe Glu Lys Thr Val Thr Pro Ser Asp Val 
180 185 



_ 7\t*ri t *u val He Pro Lys His Gin Ala Glu Lys His 
Gly Lys Leu Asn Arg Leu val lie rro uy ^ 



195 



Phe Pro Leu Pro Leu Gly Asn Asn Asn Val Ser Val Lys Gly Met Leu 
210 215 

Leu Asn Phe Glu Asp Val Asn Gly Lys Val Trp Arg Phe Arg Tyr Ser 
225 230 

Tyr Trp Asn Ser Ser Gin Ser Tyr Val Leu Thr Lys Gly Trp Ser Arg 

Phe val Lys Glu Lys Arg Leu Cys Ala Gly Asp Leu lie Ser Phe Lys 
260 265 

Arg Ser Asn Asp Gin Asp Gin Lys Phe Phe He Gly Trp Lys Ser Lys 
275 280 

Ser Gly Leu Asp Leu Glu Thr Gly Arg Val Met Arg Leu Phe Gly Val 



290 295 
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Asp lie Ser Leu Asn Ala Val Val Val Val Lys Glu Thr Thr Glu Val 
305 310 315 320 

Leu Met Ser Ser Leu Arg Cys Lys Lys Gin Arg Val Leu 
325 330 



<210> 39 

<211> 1035 

<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> CDS 

<222> (1)..(1035) 

<223> G1594 



<400> 39 

atg tac aat ttc cat teg gec ggt gat tat tea gat aag teg gtt ctg 
Met. Tyr Asn Phe His Ser Ala Gly Asp Tyr Ser Asp Lys Ser Val Leu 
15 10 15 



48 



atg atg tea ccg gag agt etc atg ttt cct tec gat tac caa get ttg 96 
Met Met Ser Pro Glu Ser Leu Met Phe Pro Ser Asp Tyr Gin Ala Leu 
20 25 30 

eta tgt tec tec gee ggt gaa aat cgt gtc tct gat gtt ttc gga tec 144 
Leu Cys Ser Ser Ala Gly Glu Asn Arg Val Ser Asp Val Phe Gly Ser 
35 40 45 

gac gag eta etc tea gta gec gtc tec get ttg teg teg gag gcg get 192 
Asp Glu Leu Leu Ser Val Ala Val Ser Ala Leu Ser Ser Glu Ala Ala 
50 . 55 60 

teg ate get ccg gag ate cga aga aat gat gat aac gtt tct eta act 240 
Ser lie Ala Pro Glu He Arg Arg Asn Asp Asp Asn Val Ser Leu Thr 
65 70 75 80 

gtc ate aaa get aaa ate get tgt cat cct teg tat cct cgc tta ctt 288 
Val lie Lys Ala Lys He Ala Cys His Pro Ser Tyr Pro Arg Leu Leu 
85 90 95 

caa get tac ate gat tgc caa aag gtc gga gca cca ccg gag ata gcg 336 
Gin Ala Tyr He Asp Cys Gin Lys Val Gly Ala Pro Pro Glu He Ala 
100 105 110 

tgt tta eta gag gag att caa egg gag agt gat gtt tat aag caa gag 384 
Cys Leu Leu Glu Glu He Gin Arg Glu Ser Asp Val Tyr Lys Gin Glu 
115 120 125 

gtt gtt cct tct tct tgc ttt gga get gat cct gag ctt gat gaa ttt 432 
Val Val Pro Ser Ser Cys Phe Gly Ala Asp Pro Glu Leu Asp Glu Phe 
130 135 140 

atg gaa acg tac tgc gat ata tta gtg aaa tac aaa teg gat eta gca 480 
Met Glu Thr Tyr Cys Asp He Leu Val Lys Tyr Lys Ser Asp Leu Ala 
145 150 155 160 

aga ccg ttt gac gag gca acg tgt ttc ttg aac aag att gag atg cag 528 
Arg Pro Phe Asp Glu Ala Thr Cys Phe Leu Asn Lys He Glu Met Gin 
165 170 175 

eta egg aac eta tgt act ggt gtc gag tct gee agg gga gtt tct ggg 576 
Leu Arg Asn Leu Cys Thr Gly Val Glu Ser Ala Arg Gly Val Ser Gly 
180 185 190 

ggg atg tct cct cat ggg gac aag act att agt cct etc ctg aca aat 624 
Gly Met Ser Pro His Gly Asp Lys Thr He Ser Pro Leu Leu Thr Asn 
195 200 205 

gac aat gga gag gat ggt gta ata tea tct gac gag gaa ctg agt gga 672 
Asp Asn Gly Glu Asp Gly Val He Ser Ser Asp Glu Glu Leu Ser Gly 
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210 



215 220 



aat aat cat qaq ata gca gag gat ggg aga caa aga tgt gaa gac egg 
1?J Asp lis G?u Val Ala Glu Asp Sly Arg Gin Arg Cys Glu Asp Arg 
225 230 235 *™ 



aac etc aaa aat aqg ttg eta cgc aaa ttt gga age cgt att agt act 
^sp Leu Lys HI ^9 LeS Leu Arg Lys Phe Gly Ser Arg lie Ser Thr 
245 250 

tta aaa ctt qaq ttc tea aag aag aag aag aaa gga aag tta cca aga 
Leu Lys Leu Glu Phe Ser Lyl Lyl Lyl Lys Lys Gly Lys Leu Pro Arg 
260 265 270 

gaa gca aga caa get ctt ctt gat tgg tgg aat etc cat tat aag tgg 
Glu Ala Arg Gin Ala Leu Leu Asp Trp Trp Asn Leu His Tyr Lys Trp 
275 280 285 

cct tac cct act gaa gga gat aag ata gca tta get gat gca acg ggg 
Pro Tyr Pro Thr Glu Gly Asp Lys lie Ala Leu Ala Asp Ala Thr Gly 

- 295 300 



tta gac caa aaa caa ate aac aat tgg ttt ata aac caa agg aaa cgt 
Leu Asp Gin Lys Gin lie Asn Asn Trp Phe lie Asn Gin Arg Lys Arg 



305 310 315 

t-aa aaa cca tea qaq aat atg cct ttc get atg atg gat gat tct 

Hit ftp Lyl Pro Ser 18 Asn Met Pro Phe Ala Met Met Asp Asp Ser 

325 -*30 no 



agt gga tea ttc ttt ace gag gaa tga 
Ser Gly Ser Phe Phe Thr Glu Glu 
340 



<210> 40 

<211> 344 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 40 

Met Tyr Asn Phe His Ser Ala Gly Asp Tyr Ser Asp Lys Ser Val Leu 
1 5 10 « 

Met Met Ser Pro Glu Ser Leu Met Phe Pro Ser Asp Tyr Gin Ala Leu 
20 25 30 

Leu Cys Ser Ser Ala Gly Glu Asn Arg Val Ser Asp Val Phe Gly Ser 
35 40 45 

Asp Glu Leu Leu Ser Val Ala Val Ser Ala Leu Ser Ser Glu Ala Ala 
50 55 60 

Ser lie Ala Pro Glu lie Arg Arg Asn Asp Asp Asn Val Ser Leu Thr 
65 70 75 

Val lie Lys Ala Lys He Ala Cys His Pro Ser Tyr Pro Arg Leu Leu 

85 9° 95 

Gin Ala Tyr lie Asp Cys Gin Lys Val Gly Ala Pro Pro Glu lie Ala 
100 105 

Cys Leu Leu Glu Glu lie Gin Arg Glu Ser Asp Val Tyr Lys Gin Glu 
7 115 120 125 



PCT/US00/31325 



720 



768 



816 



864 



912 



960 



1008 



1035 



Val Val Pro Ser Ser 



Cys Phe Gly Ala Asp Pro Glu Leu Asp Glu Phe 
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130 135 140 

Met Glu Thr Tyr Cys Asp He Leu Val Lys Tyr Lys Ser Asp Leu Ala 
145 150 155 160 

Arg Pro Phe Asp Glu Ala Thr Cys Phe Leu Asn Lys lie Glu Met Gin 
165 170 175 

Leu Arg Asn Leu Cys Thr Gly Val Glu Ser Ala Arg Gly Val Ser Gly 
180 185 190 

Gly Met Ser Pro His Gly Asp Lys Thr He Ser Pro Leu Leu Thr Asn 
195 200 205 

Asp Asn Gly Glu Asp Gly Val He Ser Ser Asp Glu Glu Leu Ser Gly 
210 215 220 

Gly Asp His Glu Val Ala Glu Asp Gly Arg Gin Arg Cys Glu Asp Arg 
225 230 235 240 

Asp Leu Lys Asp Arg Leu Leu Arg Lys Phe Gly Ser Arg lie Ser Thr 
245 250 255 

Leu Lys Leu Glu Phe Ser Lys Lys Lys Lys Lys Gly Lys Leu Pro Arg 
260 265 270 

Glu Ala Arg Gin Ala Leu Leu Asp Trp Trp Asn Leu His Tyr Lys Trp 
275 280 285. 

Pro Tyr Pro Thr Glu Gly Asp Lys He Ala Leu Ala Asp Ala Thr Gly 
290 295 300 

Leu Asp Gin Lys Gin He Asn Asn Trp Phe He Asn Gin Arg Lys Arg 
305 * 310 315 320 

His Trp Lys Pro Ser Glu Asn Met Pro Phe Ala Met Met Asp Asp Ser 
325 330 335 

Ser Gly Ser Phe Phe Thr Glu Glu 
340 

<210> 41 
<211> 2559 
<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> CDS 

<222> (1)..(2259) 

<223> G391 

<400> 41 

atg atg atg gtc cat teg atg age aga gat atg atg aac aga gag teg 48 
Met Met Met Val His Ser Met Ser Arg Asp Met Met Asn Arg Glu Ser 
15 10 15 



ccg gat aaa ggg tta gat tec ggc aag tat gtg agg tac acg ccg gag 
Pro Asp Lys Gly Leu Asp Ser Gly Lys Tyr Val Arg Tyr Thr Pro Glu 
20 25 30 



96 
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ss ss? is e £ e s e ss £ e 5 fr a ~ s 
s = a s s s s s ss e §5 a ss ss a ss 

50 " 55 60 



;;s IS SS 52 SS SS SSS SSS £ f ss SS5S 
SS SS SS B S IS E B 21 2S S S O S 5* 

ss s is a ss s ss ss s e ss ss e § a a 

100 105 

» a sss a ss a c ss is s?s ss si ss ss ss 

115 120 AZD 

SS SS SS SS SS 8 S? SS S E5 S S 35 SS SS SS? 



qtc gtg age ggt cag caa cat caa cag caa aac cca aat cct cag cat 
&1 Val sir Gly Gin Gin His Gin Gin Gin Asn Pro Asn Pro Gin His 
150 155 



145 



SS SS 3S ss ss ss ss ss ss §?; ss ss ss s; ss ss 
ss ss ss ss ss SS SS SS SSS ss ss s?s ss ss sss ss 

180 185 

SSI SSS SS S? SS S a S SS 115 SS B SS SS S?S SS 

195 20° 

ss? SS S SSI 35 SS S?S SIS S?S ss SS SS IS ss S?S S?S 

210 215 2 20 

SS SS? SS SS SS SS s ss ss; SS IS SS SS SS IS 2} 

225 230 

SS SSS S? SS SS SS S?S 3? SSS SS? SS SS SS SIS SS? SS 

SS SS !?? ss s?s a? B ss SS ss ss ss s? ss s? SSS 



ss SS S? SS SS SS SS SS 31 ss SS SSS s ss s Sf 

275 280 

ss ss sss ss ss ss m ss ss ss? ss; sss ss 3 s? ss 

290 295 300 

ss SSS ss ss s?s s?s ss ss vs ss ss ss ss ss ss ss? 

305 310 315 

SI SS SS SS 5 SS SSS S?S SS SS SS 3S SS S?S SS S?S 

3 325 330 
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144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 



912 



960 



1008 
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ggt ggt tec att etc cac att gtt gat cat gtt gat ctg gat gec tgg 1056 

Gly Gly Ser lie Leu His He Val Asp His Val Asp Leu Asp Ala Trp 

340 345 350 

agt gtc cct gaa gtc atg agg cct etc tat gaa tea teg aag att ctt 1104 

Ser Val Pro Glu Val Met Arg Pro Leu Tyr Glu Ser Ser Lys He Leu 
355 360 365 

get cag aaa atg act gtt get get ttg aga cat gta aga caa att gca 1152 

Ala Gin Lys Met Thr Val Ala Ala Leu Arg His Val Arg Gin He Ala 

370 375 380 

caa gaa aca agt gga gaa gtt cag tat ggt gga ggg cgc caa cct gcg 1200 

Gin Glu Thr Ser Gly Glu Val Gin Tyr Gly Gly Gly Arg Gin Pro Ala 

385 390 395 400 

gtt tta aga ace ttc agt caa aga etc tgt egg ggt ttc aat gat get 1248 

Val Leu Arg Thr Phe Ser Gin Arg Leu Cys Arg Gly Phe Asn Asp Ala 
405 410 415 

gtt aat ggt ttt gtg gat gat gga tgg tea cca atg ggt age gat ggt 1296 

Val Asn Gly Phe Val Asp Asp Gly Trp Ser Pro Met Gly Ser Asp Gly 

420 425 430 

gca gag gat gtt act gta atg ata aac ttg tec cct ggg aag ttt ggt 1344 

Ala Glu Asp Val Thr Val Met He Asn Leu Ser Pro Gly Lys Phe Gly 
435 440 445 

ggg tct cag tac ggt aat tea ttc ctt cca age ttt ggt agt ggc gtg 1392 

Gly Ser Gin Tyr Gly Asn Ser Phe Leu Pro Ser Phe Gly Ser Gly Val 

450 ■ 455 460 

ctt tgt gee aag gca tct atg ttg ctt cag aac gtt cca ccc get gtg 1440 

Leu Cys Ala Lys Ala Ser Met Leu Leu Gin Asn Val Pro Pro Ala Val 

465 , 470 475 480 

ctg gtt cga ttc ctt aga gaa cac cga tct gaa tgg get gat tat ggc 1488 

Leu Val Arg Phe Leu Arg Glu His Arg Ser Glu Trp Ala Asp Tyr Gly 
485 490 495 

gtg gat get tat get get gca teg etc aga gca agt cct ttt get gtt 1536 

Val Asp Ala Tyr Ala Ala Ala Ser Leu Arg Ala Ser Pro Phe Ala Val 

500 505 510 ' 

cct tgt get aga get ggg ggg ttc cca agt aac caa gtc att ctt cct 1584 

Pro Cys Ala Arg Ala Gly Gly Phe Pro Ser Asn Gin Val He Leu Pro 
515 520 525 

ctt gcg cag aca gtt gaa cat gaa gag tea ctt gag gtg gtt aga ctt 1632 

Leu Ala Gin Thr Val Glu His Glu Glu Ser Leu Glu Val Val Arg Leu 

530 535 540 

gaa ggt cac get tac tea ccc gaa gac atg ggt tta get egg gat atg 1680 

Glu Gly His Ala Tyr Ser Pro Glu Asp Met Gly Leu Ala Arg Asp Met 

545 550 555 560 

tat ttg eta cag ctt tgt age ggt gtt gat gaa aat gtg gtt gga ggt 1728 

Tyr Leu Leu Gin Leu Cys Ser Gly Val Asp Glu Asn Val Val Gly Gly 
565 570 575 

tgt gca cag ctt gta ttt gee cct ate gat gaa tea ttt get gat gat 1776 

Cys Ala Gin Leu Val Phe Ala Pro He Asp Glu Ser Phe Ala Asp Asp 

580 585 590 



gca cct ttg ctt cct tec ggt ttc cgc ate ata cct ctt gaa cag aaa 1824 
Ala Pro Leu Leu Pro Ser Gly Phe Arg He lie Pro Leu Glu Gin Lys 
595 600 605 

tct act ccg aac ggt gca tct gca aac cgt acc ctg gat tta gee tea 1872 
Ser Thr Pro Asn Gly Ala Ser Ala Asn Arg Thr Leu Abp Leu Ala Ser 
610 615 620 

get tta gaa gga tec aca cgt caa get ggt gaa gee gac cca aat ggc 1920 
Ala Leu Glu Gly Ser Thr Arg Gin Ala Gly Glu Ala Asp Pro Asn Gly 
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625 



630 



MBI0018 Sequence Listing. ST25 
635 



640 



cct ggc 
Pro Gly 
690 

etc act 
Leu Thr 
705 

gca gat 
Ala Asp 



s ss s s s e s s ;s e e a £ a e e 
xs s s a b s c s s a s E a E 32 s 

660 665 

s s e «r, s s s si e e a s e e e s 

675 680 

IS? E S E SS S S 32 SI E E E B K 

695 700 

iSBSSSSSSSgSSKSS 

710 

s s i?; s E s ss s a « c s a a 

725 730 

s ss s si e s s e e s a e e e e e 

740 745 
aca aacgcttcac eggttttcac attegcaaac caaaceggtt tagacatget 
Thr 

ggaaacgact cttgtagccc ttcaagacat aatgetagae aagacccttg acgaacctgg 
tegtaaaget etttgetctg agttccccaa gatcatgeaa cagggctatg ctcatctgcc 
ggcaggagta tgtgcgtcaa gcatgggaag gatggtatct tacgagcagg caacggtgtg 
gaaagttctt gaagacgatg aatcaaacca ctgettaget ttcatgttcg tgaattggtc 
gttcgtttga 

<210> 42 

<211> 753 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 42 

Met Met Met Val His Ser Met Ser Arg Asp Met Met Asn Arg Glu Ser 
1 5 

Lys Gly Leu Asp Ser Gly Lys Tyr Val Arg Tyr Thr Pro Glu 



Pro Asp 



Gin 



Ser 



20 25 
Val Glu Ala Leu Glu Arg Val Tyr Thr Glu Cys Pro Lys Pro Ser 



35 40 
Leu Arg Arg Gin Gin Leu He Arg Glu Cys Pro He Leu Ser Asn 



50 55 
lie Glu Pro Lys Gin lie Lys Val Trp Phe Gin Asn Arg Arg Cys Arg 



65 



70 



Glu Lys Gin Arg Lys Glu Ala Ala Arg Leu Gin Thr Val Asn Arg Lys 

8 5 *"' 



1968 

2016 

2064 

2112 

2160 

2208 

2256 

2309 

2369 
2429 
2489 
2549 
2559 
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Leu Asn Ala Met Asn Lys Leu Leu Met Glu Glu Asn Asp Arg Leu Gin 
100 105 110 

Lys Glri Val Ser Asn Leu Val Tyr Glu Asn Gly His Met Lys His Gin 
115 120 125 

Leu His Thr Ala Ser Gly Thr Thr Thr Asp Asn Ser Cys Glu Ser Val 
130 135 140 

Val Val Ser Gly Gin Gin His Gin Gin Gin Asn Pro Asn Pro Gin His 
145 150 155 160 

Gin Gin Arg Asp Ala Asn Asn Pro Ala Gly Leu Leu Ser He Ala Glu 
165 170 175 

Glu Ala Leu Ala Glu Phe Leu Ser Lys Ala Thr Gly Thr Ala Val Asp 
180 185 190 

Trp val Gin Met He Gly Met Lys Pro Gly Pro; Asp Ser He Gly He 
195 200 205 

Val Ala lie Ser Arg Asn Cys Ser Gly lie Ala Ala Arg Ala Cys Gly 
210 215 220 

Leu Val Ser Leu Glu Pro Met Lys Val Ala Glu He Leu Lys Asp Arg 
225 230 235 240 



Pro Ser Trp Leu Arg Asp Cys Arg Ser Val Asp Thr Leu Ser Val He 
245 250 255 

Pro Ala Gly Asn Gly Gly Thr He Glu Leu He Tyr Thr Gin Met Tyr 
260 265 270 

Ala Pro Thr Thr Leu Ala Ala Ala Arg Asp Phe Trp Thr Leu Arg Tyr 
275 280 285 

Ser Thr Cys Leu Glu Asp Gly Ser Tyr Val Val Cys Glu Arg Ser Leu 
290 295 300 

Thr Ser Ala Thr Gly Gly Pro Thr Gly Pro Pro Ser Ser Asn Phe Val 
305 310 315 320 

Arg Ala Glu Met Lys Pro Ser Gly Phe Leu lie Arg Pro Cys Asp Gly 
325 330 335 

Gly Gly Ser He Leu His He Val Asp His Val Asp Leu Asp Ala Trp 
340 345 350 

Ser Val Pro Glu Val Met Arg Pro Leu Tyr Glu Ser Ser Lys He Leu 
355 360 365 

Ala Gin Lys Met Thr Val Ala Ala Leu Arg His Val Arg Gin He Ala 
370 375 380 



Gin Glu Thr Ser Gly Glu Val Gin Tyr Gly Gly Gly Arg Gin Pro Ala 
385 390 395 400 
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Val Leu Arg Thr Phe Ser Gin Arg Leu Cys Arg Gly Phe Asn Asp Ala 
405 410 

val Asn Gly Phe Val Asp Asp Gly Trp Ser Pro Met Gly Ser Asp Gly 
420 425 

Ala Glu Asp Val Thr Val Met lie Asn Leu Ser Pro Gly Lys Phe Gly 
435 440 4,5 



Gly ser Gin Tyr Gly Asn Ser Phe Leu Pro Ser Phe Gly Ser Gly Val 
450 455 

Leu Cys Ala Lys Ala Ser Met Leu Leu Gin Asn Val Pro Pro Ala Val 
465 470 

Leu Val Arg Phe Leu Arg Glu His Arg Ser Glu Trp Ala Asp Tyr Gly 
435 490 



Val Asp Ala Tyr Ala Ala Ala Ser Leu Arg Ala Ser Pro Phe Ala Val 



500 



Cys Ala Arg Ala Gly Gly Phe Pro Ser Asn Gin Val He Leu Pro 
515 520 5 



Leu Ala Gin Thr Val Glu His Glu Glu Ser Leu Glu Val Val Arg Leu 
530 535 540 

Glu Gly His Ala Tyr Ser Pro Glu Asp Met Gly Leu Ala Arg Asp Met 
545 550 5&s 

Tyr Leu Leu Gin Leu Cys Ser Gly Val Asp Glu Asn Val Val Gly Gly 



565 



Cys Ala Gin Leu Val Phe Ala Pro lie Asp Glu Ser Phe Ala Asp Asp 



580 



Ala Pro Leu Leu Pro Ser Gly Phe Arg He lie Pro Leu Glu Gin Lys 
595 600 f" 3 



Ser Thr Pro Asn Gly Ala Ser Ala Asn Arg Thr Leu Asp Leu Ala Ser 
Ala Leu Glu Gly Ser Thr Arg Gin Ala Gly Glu Ala Asp Pro Asn Gly 



625 



630 



Cys Asn Phe Arg Ser Val Leu Thr lie Ala Phe Gin Phe Thr Phe Asp 
645 650 



Asn His ser Arg Asp Ser Val Ala Ser Met Ala Arg Gin Tyr Val Arg 
660 665 

Ser He Val Gly Ser lie Gin Arg Val Ala Leu Ala lie Ala Pro Arg 

680 



675 



Pro Gly Ser Asn lie Ser Pro lie Ser Val Pro Thr Ser Pro Glu Ala 
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690 695 700 

Leu Thr Leu Val Arg Trp He Ser Arg Ser Tyr Ser Leu His Thr Gly 
705 710 715 720 

Ala Asp Leu Phe Gly Ser Asp Ser Gin Thr Ser Gly Asp Thr Leu Leu 
725 730 735 

His Gin Leu Trp Asn His Ser Asp Ala He Leu Cys Cys Ser Leu Lys 
740 745 750 



Thr 



<210> 43 

<211> 2526 

<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> CDS 

<222> (1)..(2526) 

<223> G390 



<400> 43 



48 



96 



atg atg get cat cac tec atg gac gat aga gac tct cct gat aaa gga 

Met Met Ala His His Ser Met Asp Asp Arg Asp Ser Pro Asp Lys Gly 
1 5 10 15 

ttt gat tec ggc aag tac gtt aga tac acg ccg gaa caa gtt gaa get 

Phe Asp Ser Gly Lys Tyr Val Arg Tyr Thr Pro Glu Gin Val Glu Ala 
20 25 30 

ctt gag aga gtt tat get gag tgt cct aaa cct age tct ctg aga aga 144 
Leu Glu Arg Val Tyr Ala Glu Cys Pro Lys Pro Ser Ser Leu Arg Arg 
35 40 45 

caa cag ctt att cgt gaa tgt ccc att etc tgt aac ate gag cct cga 192 

Gin Gin Leu He Arg Glu Cys Pro He Leu Cys Asn He Glu Pro Arg 

50 55 60 

cag ate aaa gtt tgg ttc cag aat cgc aga tgt cga gag aag cag agg 
Gin He Lys Val Trp Phe Gin Asn Arg Arg Cys Arg Glu Lys Gin Arg 
65 70 75 80 



240 



288 



336 



384 



aaa gag tea get cgt ctt cag aca gtg aac agg aag ctg agt get atg 
Lys Glu Ser Ala Arg Leu Gin Thr Val Asn Arg Lys Leu Ser Ala Met 
85 90 95 

aac aag ctt ttg atg gaa gag aat gat cgt ttg cag aag caa gtc tec 
Asn Lys Leu Leu Met Glu Glu Asn Asp Arg Leu Gin Lys Gin Val Ser 
100 105 HO 

aac ttg gtt tat gag aat gga ttc atg aaa cat cga ate cac act get 
Asn Leu Val Tyr Glu Asn Gly Phe Met Lys His Arg He His Thr Ala 
115 120 125 

tct ggg acg ace aca gac aac age tgt gag tct gtg gtc gtg agt ggt 432 
Ser Gly Thr Thr Thr Asp Asn Ser Cys Glu Ser Val Val Val Ser Gly 
130 135 140 

cag caa cgt cag cag caa aac cca aca cat cag cat cct cag cgt gat 
Gin Gin Arg Gin Gin Gin Asn Pro Thr His Gin His Pro Gin Arg. Asp 
145 150 155 160 

gtt aac aac cca get aat ctt etc teg att gcg gag gag acc ttg gcg 
Val Asn Asn Pro Ala Asn Leu Leu Ser He Ala Glu Glu Thr Leu Ala 
165 170 175 



480 



528 
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gag etc ctt tgc aag get aca gga act get gtc gac tgg gtc cag atg 
Glu Phe Leu (*s Lys Ala Thr Gly Thr Ala Val Asp Trp Val Gin Met 
180 185 190 

att qqq atg aag cct ggt ccg gat tct att ggt ate gta get gtt tea 
ill Gly Met Lyl Pro Gly Pro Asp Ser lie Gly He Val Ala Val Ser 
!95 200 205 

cac aac tqc aqt gga ata gca gca cgt gee tgt ggc etc gtg agt tta 
Arg Asn £s Ser G?y He Ala Ala Arg Ala Cys Gly Leu Val Ser Leu 



aaa ccc atq aaq gtc get gaa ate etc aaa gat cgt cca tct tgg ttc 
l!u Pro Me? Lyl Val Ala Glu lie Leu Lys Asp. Arg Pro Ser Trp Phe 
225 230 235 *™ 



cot qac tqt cga tgt gtc gag act ctg aat gtt ata ccc act gga aat 
K Asp £s j*g Cys Val Glu Thr Leu Asn Val lie Pro Thr Gly Asn 

aat aat act ate qag ctt gtc aac act cag att tat get cct aca aca 
Ig % ?hr lie Glu Leu val Asn Thr Gin lie Tyr Ala Pro Thr Thr 



tta gca gca get cgt gac ttt tgg acg ctg aga tat agt aca agt eta 
Leu Ala Ala Ala Arg Asp Phe Trp Thr Leu Arg Tyr Ser Thr Ser Leu 
275 280 

hai- aaa aac tat atq qtc tgt gag aga tea etc act tct gca act 
l!u lt P E!y Hr £r III Val Cys III A?g Ser Leu Thr Ser Ala Thr 
290 ' 295 300 

qqt qqc ccc aat ggt cca ctt tct tea age ttc gtg aga gec aaa atg 

Ily Gly Pro Asn Gly Pro Leu Ser Ser Ser Phe Val Arg Ala Lys Met 

305 310 315 

eta tea aac qqq ttt ctt ate cgt cct tgt gat ggt ggt ggt tec att 

Leu Ser sir Gly Phe Leu lie Arg Pro Cys Asp Gly Gly Gly Ser lie 



att cac ate qtt gat cat gtg gac ttg gat gtc tea agt gtt cct gaa 
Ue Tit At Val Asp His Val Lp Leu Asp Val Ser Ser Val Pro Glu 
340, 345 350 

ate etc agg cct ctt tat gag tct tec aaa ate ctt get caa aaa atg 
Vat let tel Pro Leu Tyr Glu Ser Ser Lys lie Leu Ala Gin Lys Met 
355 360 36b 

act ate qct qct ctg aga cat gtg cgc caa att get caa gag act agt 
?Sr Val Ala Ala Leu Arg His Val Arg Gin lie Ala Gin Glu Thr Ser 
370 375 380 

aaa aaa ate caq tat agt ggt gga cgc cag cct gca gtt tta agg act 
l!y llu Val III Ser Ely Sly Arg Gin Pro Ala Val Leu Arg Thr 
385 390 395 ™u 

an,, i-arr aaa etc tac cqq qqt ttc aat gat get gta aat ggt ttt 
Phe sir Gin A?g Leu §S fg oly Phe Asn Lp Ala Val Asn Gly Phe 
405 410 

gtc gat gat gga tgg tct cca atg agt agt gat gga gga gag gat att 
Val Asp Asp Gly Tr? Ser Pro Met Ser Ser Asp Gly Gly Glu Asp lie 



385 " 390 395 

etc tgc 
Leu Cys 
405 

agt agt 
Ser Ser 

420 ~ 425 430 

acg ate atg att aac tct tec tct get aaa ttt get ggc tec caa tac 
Th? lie Mel lie Asn Ser Ser Ser Ala Lys Phe Ala Gly Ser Gin Tyr 
435 440 445 

cat aac tea ttt ctt cca agt ttt gga agt ggt gtc etc tgt gee aaa 
l!y ler Ser Phe ^u Pro sir Phe Gly Ser Gly Val Leu Cys Ala Lys 
450 455 460 



act tct atq ctg ttg cag aat gtt cca ccc ctt gta ttg att egg ttc 
III Ser Me? Leu Leu Oil Asn Val Pro Pro Leu Val Leu He Arg Phe 
465 470 475 *80 
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624 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 



1344 



1392 
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ctg aga gaa cac cga get gaa tgg gca gac tat ggt gtc gat gec tat 1488 

Leu Arg Glu His Arg Ala Glu Trp Ala Asp Tyr Gly Val Asp Ala Tyr 
485 490 495 

tct get gca tct etc aga gca act cca tat get gtt cca tgc gtc aga 1536 

Ser Ala Ala Ser Leu Arg Ala Thr Pro Tyr Ala Val Pro Cys Val Arg 
500 505 510 



acc ggt ggg ttc ccg agt aac caa gtc att ctt cct etc gca cag aca 
Thr Gly Gly Phe Pro Ser Asn Gin Val lie Leu Pro Leu Ala Gin Thr 
515 520 525 



1584 



etc gaa cat gaa gag ttt etc gaa gtg gtt aga ctt gga ggt cat get 
Leu Glu His Glu Glu Phe Leu Glu Val Val Arg Leu Gly Gly His Ala 
530 - 535 540 



1632 



tac tea cct gaa gac atg ggc tta tec egg gat atg tat tta ctg cag 
Tyr Ser Pro Glu Asp Met Gly Leu Ser Arg Asp Met Tyr Leu Leu Gin 
545 550 555 560 



1680 



ctt tgt age ggc gtt gat gaa aat gtg gtt gga ggt tgt get cag ctt 
Leu Cys Ser Gly Val Asp Glu Asn Val Val Gly Gly Cys Ala Gin Leu 
565 570 575 



1728 



gtc ttt gee cca ate gat gaa tea ttt get gat gat gca cct ttg ctt 
Val Phe Ala Pro lie Asp Glu Ser Phe Ala Asp Asp Ala Pro Leu Leu 
580 585 590 



1776 



cct tct ggt ttc cgt gtc ata cca etc gac caa aaa aca aat ccg aat 
Pro Ser Gly Phe Arg Val lie Pro Leu Asp Gin Lys Thr Asn Pro Asn 
595 600 605 



1824 



gat cat caa tct gca agt cga aca egg gat eta gca teg tec eta gat 
Asp His Gin Ser Ala Ser Arg Thr Arg Asp Leu Ala Ser Ser Leu Asp 
610 615 620 



1872 



ggt tec acc aaa acc gat teg gaa aca aac tct aga ttg gtc tta aca 
Gly Ser Thr Lys Thr Asp Ser Glu Thr Asn Ser Arg Leu Val Leu Thr 
625 630 635 640 



1920 



ata gee ttc cag ttc acg ttt gat aac cat tec aga gac aat gtt get 
lie Ala Phe Gin Phe Thr Phe Asp Asn His Ser Arg Asp Asn Val Ala 
645 650 655 



1968 



aca atg gcg aga cag tat gtg agg aac gtt gtt ggt teg att cag aga 
Thr Met Ala Arg Gin Tyr Val Arg Asn Val Val Gly Ser He Gin Arg 
660 665 670 



2016 



gtg get eta gee att acg cct cgt cct ggc tea atg caa ctt ccc act 
Val Ala Leu Ala He Thr Pro Arg Pro Gly Ser Met Gin Leu Pro Thr 
675 680 685 



2064 



tec cct gaa get etc act ctt gtc cgt tgg ate acc cgt agt tac agt 
Ser Pro Glu Ala Leu Thr Leu Val Arg Trp He Thr Arg Ser Tyr Ser 
690 695 700 



2112 



att cat aca ggt gca gat ctg ttt gga get gat tct cag tec tgt gga 
lie His Thr Gly Ala Asp Leu Phe Gly Ala Asp Ser Gin Ser Cys Gly 
705 710 715 720 



2160 



gga gac aca ttg ctt aag caa etc tgg gac cat agt gat gec ata ttg 
Gly Asp Thr Leu Leu Lys Gin Leu Trp Asp His Ser Asp Ala He Leu 
725 730 735 



2208 



tgc tgc tec ctg aaa act aat gec tea ccg gta ttc aca ttt gca aac 
Cys Cys Ser Leu Lys Thr Asn Ala Ser Pro Val Phe Thr Phe Ala Asn 
740 745 750 



2256 



caa get ggt tta gac atg ctt gaa act aca ctt gtg gca ctt cag gat 
Gin Ala Gly Leu Asp Met Leu Glu Thr Thr Leu Val Ala Leu Gin Asp 
755 760 765 



2304 



ata atg etc gac aaa aca ctt gat gac tct ggt cgt aga get ctt tgc 
He Met Leu Asp Lys Thr Leu Asp Asp Ser Gly Arg Arg Ala Leu Cys 
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770 



MBI0018 Sequence Listing.ST25 
775 780 



tec aaa ttc qcc aag ate atg cag cag gga tat gcg aat ctt ccg gca 
Ser §lS Phe III lys He Me! Gin Gin Gly Tyr Ala Asn Leu Pro Ala 
785 790 

aaa ata tqt gtg teg age atg ggc aga ccg gtt teg tat gag caa gcg 
l!y Ue Cys Val Ser sir Met Gly Arg Pro Val Ser Tyr Glu Gin Ala 

acg gtg tgg aaa gtt gtt gat gac aac gaa tea aac cac tgc ttg get 
Thr Val Tr^ Lys Val Val Asp Asp Asn Glu Ser Asn His Cys Leu Ala 
820 825 

ttt ace etc gtt agt tgg teg ttt gtt tga 
Phe Thr Leu Val Ser Trp Ser Phe val 
835 



2400 



2448 



2496 



2526 



<210> 44 
<211> 841 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 44 

Met Met Ala His His Ser Met Asp Asp Arg Asp Ser Pro Asp Lys Gly 



10 



1 5 

Phe Asp Ser Gly Lys Tyr Val Arg Tyr Thr Pro Glu Gin Val Glu Ala 
20 25 

Leu Glu Arg Val Tyr Ala Glu Cys Pro Lys Pro Ser Ser Leu Arg Arg 



35 



40 



Gin Gin Leu lie Arg Glu Cys Pro lie Leu Cys Asn He Glu Pro Arg 
50 55 60 

Gin lie Lys Val Trp Phe Gin Asn Arg Arg Cys Arg Glu Lys Gin Arg 



65 



70 



Lys Glu Ser Ala Arg Leu Gin Thr Val Asn Arg Lys Leu Ser Ala Met 



85 



Asn Lys Leu Leu Met Glu Glu Asn Asp Arg Leu Gin Lys Gin Val Ser 



100 



105 



Asn Leu Val Tyr Glu Asn Gly Phe Met Lys His Arg lie His Thr Ala 
115 120 

Ser Gly Thr Thr Thr Asp Asn Ser Cys Glu Ser Val Val Val Ser Gly 



130 



135 



Gin Gin Arg Gin Gin Gin Asn Pro Thr His Gin His Pro Gin Arg Asp 
145 150 155 

val Asn Asn Pro Ala Asn Leu Leu Ser lie Ala Glu Glu Thr Leu Ala 



165 



170 



Glu Phe Leu Cys Lye Ala Thr Gly Thr Ala Val Asp Trp Val Gin Met 
180 185 



lie Gly Met Lys Pro Gly Pro Asp Ser He Gly lie Val Ala Val 
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200 205 



Arg Asn Cys Ser Gly He Ala Ala Arg Ala Cys Gly Leu Val Ser Leu 
210 215 220 



Glu Pro Met Lys Val Ala Glu He Leu Lys Asp Arg Pro Ser Trp Phe 
225 230 235 240 



Arg Asp Cys Arg Cys Val Glu Thr Leu Asn Val He Pro Thr Gly Asn 
245 250 255 



Gly Gly Thr He Glu Leu Val Asn Thr Gin He Tyr Ala Pro Thr Thr 
260 265 270 



Leu Ala Ala Ala Arg Asp Phe Trp Thr Leu Arg Tyr Ser Thr Ser Leu 
275 280 285 



Glu Asp Gly Ser Tyr Val Val Cys Glu Arg Ser Leu Thr Ser Ala Thr 
290 295 300 



Gly Gly Pro Asn Gly Pro Leu Ser Ser Ser Phe Val Arg Ala Lys Met 
305 310 315 320 



Leu Ser Ser Gly Phe Leu He Arg Pro Cys Asp Gly Gly Gly Ser He 
325 330 335 



He His He Val Asp His Val Asp Leu Asp Val Ser Ser Val Pro Glu 
340 345 350 



Val Leu Arg Pro Leu Tyr Glu Ser Ser Lys He Leu Ala Gin Lys Met 
355 360 365 



Thr Val Ala Ala Leu Arg His Val Arg Gin He Ala Gin Glu Thr Ser 
370 375 380 



Gly Glu Val Gin Tyr Ser Gly Gly Arg Gin Pro Ala Val Leu Arg Thr 
385 390 395 400 



Phe Ser Gin Arg Leu Cys Arg Gly Phe Asn Asp Ala Val Asn Gly Phe 
405 ~ 410 415 



Vai Asp Asp Gly Trp Ser Pro Met Ser Ser Asp Gly Gly Glu Asp He 
420 425 430 



Thr He Met He Asn Ser Ser Ser Ala Lys Phe Ala Gly Ser Gin Tyr 
435 440 445 



Gly Ser Ser Phe Leu Pro Ser Phe Gly Ser Gly Val Leu Cys Ala Lys 
450 455 460 



Ala Ser Met Leu Leu Gin Asn Val Pro Pro Leu Val Leu He Arg Phe 
465 470 475 480 



Leu Arg Glu His Arg Ala Glu Trp Ala Asp Tyr Gly Val Asp Ala Tyr 



485 



490 



495 
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Ser Ma Ma Ser Leu Arg Ala -r Pro Tyr Ala ^ 

, T1 t^, Pro Leu Ala Gin Thr 
cor Asn Gin Val He Ijeu Pro 
Thr Gly Gly Phe Pro ser Asn 5 25 
515 

530 

noo Me>t Tvr Leu Leu Gin 
* M »r Glv Leu Ser Arg Asp Met iyr 56Q 
Tyr Ser Pro Glu Asp Met Gly 555 
545 

_ w . - - « - ~ - « s w ~ "* « " 

« « ai. B » « - - S - - - S " 

595 

o, e e r Ser Leu Asp 
Asp His Gin Set Ala « Jg » • « - » S ~ 
610 

* aso Ser Glu Thr Asn Ser Arg Leu val Leu Thr 
Gly Ser Thr Lys Thr Asp Ser g35 
625 

or Arq Asp Asn Val Ala 
ne Ala Phe Gin Phe Thr Phe Asp Asn Hx; Ser Ar 9 

64b 

acn val val Gly Ser He Gin Arg 
n *™ rln Tvr Val Arg Asn Val vai r 
Thr Met Ala Arg Gin iyr 665 
660 

~„ rlv Ser Met Gin Leu Pro Thr 
val Ala Leu Ala He Thr Pro Arg Pro Gly 

675 

, val Arg Trp He Thr Arg Ser Tyr Ser 
Glu Ala Leu Thr Leu val Arg Trp 7Q0 



690 

n . 81 . « „ ai. -j - ~ « - » s « - s " ws « 

705 

w w . - - - - - - a -•- M - " 

„ w . - «. - - «• s - ~ - ~ « ~ 

^ y 7 740 

«— - .««-.«- 

755 

o ^ riv Arq Arg Ala Leu Cys 
XX. Met Leu Asp Lys Thr Leu Asp Asp Ser Gly Arg 
770 

Tl. Met Gin Gin Gly Tyr Ala Asn Leu Pro Ala 
Ser Glu Phe Ala Lys lie Met Gin 
785 
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Gly He Cys Val Ser Ser Met Gly Arg Pro Val Ser Tyr Glu Gin Ala 
805 810 815 

Thr Val Trp Lys Val Val Asp Asp Asn Glu Ser Asn His Cys Leu Ala 
820 825 830 

Phe Thr Leu Val Ser Trp Ser Phe Val 
835 840 

<210> 45 

<211> 2511 

<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> CDS 

<222> (1)..(2511) 

<223> G1548 

<400> 45 

atg gca atg tct tgc aag gat ggt aag ttg gga tgt ttg gat aat ggg 48 
Met Ala Met Ser Cys Lys Asp Gly Lys Leu Gly Cys Leu Asp Asn Gly 
1 5 10 15 

aag tat gtg agg tat aca cct gaa caa gtt gaa gca ctt gag agg ctt 96 
Lys Tyr Val Arg Tyr Thr Pro Glu Gin Val Glu Ala Leu Glu Arg Leu 
20 25 30 

tat cat gac tgt cct aaa ccg agt tct att cgc cgt cag cag ttg ate 144 
Tyr His Asp Cys Pro Lys Pro Ser Ser He Arg Arg Gin Gin Leu He 
35 40 45 

aga gag tgt cct att etc tct aac att gag cct aaa cag ate aaa gtg 192 
Arg Glu Cys Pro lie Leu Ser Asn He Glu Pro Lys Gin He Lys Val 
50 55 60 

tgg ttt cag aac cga aga tgt aga gag aaa caa agg aaa gag get tea 240 
Trp Phe Gin Asn Arg Arg Cys Arg Glu Lys Gin Arg Lys -Glu Ala Ser 
65 70 75 80 

egg ctt caa. get gtg aat egg aag ttg acg gca atg aac aag etc ttg 288 
Arg Leu Gin Ala Val Asn Arg Lys Leu Thr Ala Met Asn Lys Leu Leu 
85 90 95 

atg gag gag aat gac agg ttg cag aag caa gtg tea cag ctg gtc cat 336 
Met Glu Glu Asn Asp Arg Leu Gin Lys Gin Val Ser Gin Leu Val His 
100 105 110 

gaa aac age tac ttc cgt caa cat act cca aat cct tea etc cca get 384 
Glu Asn Ser Tyr Phe Arg Gin His Thr Pro Asn Pro Ser Leu Pro Ala 
115 120 125 

aaa gac aca age tgt gaa teg gtg gtg acg agt ggt cag cac caa ttg 432 
Lys Asp Thr Ser Cys Glu Ser Val Val Thr Ser Gly Gin His Gin Leu 
130 135 140 

gca tct caa aat cct cag aga gat get agt cct gca gga ctt ttg tec 480 
Ala Ser Gin Asn Pro Gin Arg Asp Ala Ser Pro Ala Gly Leu Leu Ser 
145 150 155 160 

att gca gaa gaa act tta gca gag ttt ctt tea aag gca act gga ace 528 
He Ala Glu Glu Thr Leu Ala Glu Phe Leu Ser Lys Ala Thr Gly Thr 
165 170 175 

get gtt gag tgg gtt cag atg cct gga atg aag cct ggt ccg gat tec 576 
Ala Val Glu Trp Val Gin Met Pro Gly Met Lys Pro Gly Pro Asp Ser 
180 185 190 

att gga ate ate get att tct cat ggt tgc act ggt gtg gca gca cgc 624 
He Gly He He Ala He Ser His Gly Cys Thr Gly Val Ala Ala Arg 
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200 _ ... „ fc 672 



tc 

at etc gag ect aca agg gtt go. Vai 

-s s e s ta e i « ~ Tht Arg s atg 

tto cgc gaa tflt cga get gtt g» 9« * e ? 
« gaC S - SS S & & Si- cvs Arg Ma 
iy s Asp Arg Pro 23Q 

" 5 „ «. - - e- a a w s e e » a a - 

Gin 26Q 

Ar9 ser Leu I*. ^ 

Asn Phe val ft * 310 t ttg 

305 ^ a ta cac ata gtg fat cat atg a Leu 

B B s a e a e « 5 a 8 " w ' s s " " 

Olu Ala cy ctc aag 

Gin He Ala Gin Oiu 3?5 

1 * « - » a a a a a a = =■ a 

Gly Phe Asn 4Q5 tct cca 

tQ cat gat gtc aca ate act gta aac ^ ger pro 
\ll My S mS K U 116 ^ 

lie Gly P 420 ct cct 

gac aag eta atg % eta aat ctt aca ttt gee - & phe Ala Pro 

tt a tge Tea aaa gca tea atg ett «. g9 £ 
Val Ser Asn Val vaj. ^ 

val Pro Pro Ala 47Q 

465 nro tat eta gca gca gca gtt aaa 9^ 

« a a a a a a a $ ffi - u * *" * 

99g « Mt * «<* «tt »' f>s|e „ 



720 
768 
816 
864 
912 
960 
1008 
1056 
1104 
1152 
1200 



1248 
1296 
1344 
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1440 
1488 
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Gly Pro Cys Ser Ala Arg Val Gly Gly Phe Gly Gly Gin Val lie Leu 
500 505 510 

cca ctt get cat act att gag cat gaa gag ttt atg gaa gtc ate aaa 1584 
Pro Leu Ala His Thr He Glu His Glu Glu Phe Met Glu Val He Lys 
515 520 525 

ttg gaa ggt ctt ggt cat tec cct gaa gat gca ate gtt cca aga gat 1632 
Leu Glu Gly Leu Gly His Ser Pro Glu Asp Ala He Val Pro Arg Asp 
530 535 540 

ate ttc ctt ctt caa ctt tgt age gga atg gat gaa aat get gta gga 1680 
He Phe Leu Leu Gin Leu Cys Ser Gly Met Asp Glu Asn Ala Val Gly 
545 550 555 560 

ace tgt gcg gaa ctt ata ttt get cca ate gat get teg ttt gcg gat 1728 
Thr Cys Ala Glu Leu He Phe Ala Pro He Asp Ala Ser Phe Ala Asp 
565 570 575 

gat gca cct ctg ctt cct tct ggt ttt cgt att ate cct ctt gat tec 1776 
Asp Ala Pro Leu Leu Pro Ser Gly Phe Arg He He Pro Leu Asp Ser 
580 585 590 

gca aag gaa gta tct age cca aac cga ace ttg gat ctt get teg gca 1824 
Ala Lys Glu Val Ser Ser Pro Asn Arg Thr Leu Asp Leu Ala Ser Ala 
595 600 605 

ctg gaa att ggt tea get gga aca aaa gee tea act gat caa tea gga 1872 
Leu Glu He Gly Ser Ala Gly Thr Lys Ala Ser Thr Asp Gin Ser Gly 
610 615 620 

aac tec aca tgt gca aga tct gtg atg aca ata gca ttt gag ttt ggt 1920 
Asn Ser Thr Cys Ala Arg Ser Val Met Thr lie Ala Phe Glu Phe Gly 
625 630 635 640 

ate gag age cat atg caa gaa cat gta gca tec atg get agg cag tat 1968 
He Glu Ser His Met Gin Glu His Val Ala Ser Met Ala Arg Gin Tyr 
645 650 655 

gtt cga ggt ate ata tea teg gtg cag aga gta gca ttg get ctt tct 2016 
Val Arg Gly He He Ser Ser Val Gin Arg Val Ala Leu Ala Leu Ser 
660 665 670 

cct tct cat ate age tea caa gtt ggt eta cgc act cct ttg ggt act 2064 
Pro Ser His He Ser Ser Gin Val Gly Leu Arg Thr Pro Leu Gly Thr 
675 680 685 

cct gaa gec caa aca ctt get cgt tgg att tgc cag agt tac agg ggc 2112 
Pro Glu Ala Gin Thr Leu Ala Arg Trp He Cys Gin Ser Tyr Arg Gly 
690 695 700 

tac atg ggt gtt gag eta ctt aaa tea aac agt gac ggc aat gaa tct 2160 
Tyr Met Gly Val Glu Leu Leu Lys Ser Asn Ser Asp Gly Asn Glu Ser 
705 710 715 720 

att ctt aag aat ctt tgg cat cac act gat get ata ate tgc tgc tea 2208 
He Leu Lys Asn Leu Trp His His Thr Asp Ala He He Cys Cys Ser 
725 730 735 

atg aag gec ttg ccc gtc ttc aca ttt gca aac cag gcg gga ctt gac 2256 
Met Lys Ala Leu Pro Val Phe Thr Phe Ala Asn Gin Ala Gly Leu Asp 
740 745 750 

atg ctg gag act aca tta gtt get ctt caa gac ate tct tta gag aag 2304 
Met Leu Glu Thr Thr Leu Val Ala Leu Gin Asp He Ser Leu Glu Lys 
755 760 765 

ata ttt gat gac aat gga aga aag act ctt tgc tct gag ttc cca cag 2352 
He Phe Asp Asp Asn Gly Arg Lys Thr Leu Cys Ser Glu Phe Pro Gin 
770 775 780 

ate atg caa cag ggc ttc gcg tgc ctt caa ggc ggg ata tgt etc tea 24 00 
He Met Gin Gin Gly Phe Ala Cys Leu Gin Gly Gly He Cys Leu Ser 
785 790 795 800 
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MBI0018 sequence ^f^t'g^a gta 2448 

s a b a i e a s e § e «» - » " l 
s s k e e k £ si | - $ » « s 55 

820 2511 

tag tec ttt gtg tga 
TrS Ser Phe val 
* 835 

<210> 46 
<211> 836 

lllll Arabidopsis thaliana 

AS p Gly Lys Leu Gly cys Leu Asp Asn Gly 
Met Ala Met Ser Cys Lys Asp Gly Ly ^ 15 

L. w « « - - ~ « a- - ° w M ' - » MU 

c » tie Arg Arg Gin Gin Leu He . 
^ His Asp Cys Pro Lys Pro Ser Ser He Arg ^ 

' 35 

tt rlu Pro Lys Gin He Lys Val 
Arg Glu Cys Pro He Leu Ser Asn He Glu Y 
50 

„ « - « - - - - * a- " ws G1 " "* ~ 



«. „ s„ ~ ~ « « s ~ - - ~ s - " 

115 

, TKr cpr Glv Gin His Gin Leu 
Lys Asp Thr ser Cys Glu Ser Val val Tnr Ser ay 
130 

_ «. - - « « - - s " S - 011 - s 

145 

ne M . «. s - - - - s; - - M * - s ~ 

Gly Met Lys Pro Gly Pro Asp Ser 



Ala 



val Glu Trp val Gin Met Pro Giy ~* — 



180 



G1 xx. XX. -a He Ser His Giy Cys Tnr Gly Val Ala Ala Arg 
lie Gly He lie 200 
195 

Ala «, - V.. „ - - - « S "* ~ 
210 
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Lys Asp Arg Pro Ser Trp Phe Arg Glu Cys Arg Ala Val Glu Val Met 
225 230 235 240 

Asn Val Leu Pro Thr Ala Asn Gly Gly Thr Val Glu Leu Leu Tyr Met 
245 250 255 

Gin Leu Tyr Ala Pro Thr Thr Leu Ala Pro Pro Arg Asp Phe Trp Leu 
260 265 270 

Leu Arg Tyr Thr Ser Val Leu Glu Asp Gly Ser Leu Val Val Cys Glu 
275 280 285 

Arg Ser Leu Lys Ser Thr Gin Asn Gly Pro Ser Met Pro Leu Val Gin 
290 295 300 

Asn Phe Val Arg Ala Glu Met Leu Ser Ser Gly Tyr Leu lie Arg Pro 
305 " 310 315 320 

Cys Asp Gly Gly Gly Ser lie lie His He Val Asp His Met Asp Leu 
325 330 335 

Glu Ala Cys Ser Val Pro Glu Val Leu Arg Pro Leu Tyr Glu Ser Pro 
340 345 350 

Lys Val Leu Ala Gin Lys Thr Thr Met Ala Ala Leu Arg Gin Leu Lys 
355 * 360 365 

Gin He Ala Gin Glu Val Thr Gin Thr Asn Ser Ser Val Asn Gly Trp 
370 375 , 380 

Gly Arg Arg Pro Ala Ala Leu Arg Ala Leu Ser Gin Arg Leu Ser Arg 
385 390 395 400 

Gly Phe Asn Glu Ala Val Asn Gly Phe Thr Asp Glu Gly Trp Ser Val 
405 410 415 

He Gly Asp Ser Met Asp Asp Val Thr He Thr Val Asn Ser Ser Pro 
420 425 ' 430 

Asp Lys Leu Met Gly Leu Asn Leu Thr Phe Ala Asn Gly Phe Ala Pro 
435 440 445 

Val Ser Asn Val Val Leu Cys Ala Lys Ala Ser Met Leu Leu Gin Asn 
450 455 460 

Val Pro Pro Ala He Leu Leu Arg Phe Leu Arg Glu His Arg Ser Glu 
465 470 475 480 

Trp Ala Asp Asn Asn lie Asp Ala Tyr Leu Ala Ala Ala Val Lys Val 
485 490 495 

Gly Pro Cys Ser Ala Arg Val Gly Gly Phe Gly Gly Gin Val He Leu 
500 505 510 

Pro Leu Ala His Thr He Glu His Glu Glu Phe Met Glu Val He Lys 
515 520 525 
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Leu Glu Gly Leu Gly His Ser Pro Glu Asp Ala lie Val Pro Arg Asp 
530 535 

He Phe Leu Leu Gin Leu Cys Sef Gly Met Asp Glu Asn Ala Val Gly 
545 550 555 

Thr Cys Ala Glu Leu He Phe Ala Pro lie Asp Ala Ser Phe Ala Asp 

AS p Ala Pro Leu Leu Pro Ser Gly Phe Arg lie He Pro Leu Asp Ser 
580 585 

Ala Lys Glu val Ser Ser Pro Asn Arg Thr Leu Asp Leu Ala Ser Ala 
595 600 

Leu Glu lie Gly Ser Ala Gly Thr Lys Ala Ser Thr Asp Gin Ser Gly 
610 615 

Asn ser Thr Cys Ala Arg Ser Val Met Thr lie Ala Phe Glu Phe Gly 
G25 630 " 5 

lie Glu Ser His Met Gin Glu His Val Ala Ser Met Ala Arg Gin Tyr 

Val Arg Gly He He Ser Ser Val Gin Arg Val Ala Leu Ala. Leu Ser 
660 665 

Pro Ser His He Ser Ser Gin Val Gly Leu Arg Thr Pro Leu Gly Thr 
675 680 

Pro Glu Ala Gin Thr Leu Ala Arg Trp He Cys Gin Ser Tyr Arg Gly 
690 695 

Tyr Met Gly Val Glu Leu Leu Lys Ser Asn Ser Asp Gly Asn Glu Ser 
705 710 

He Leu Lys Asn Leu Trp His His Thr Asp Ala He He Cys Cys Ser 
725 730 

„et Lys Ala Leu Pro Val Phe Thr Phe Ala Asn Gin Ala Gly Leu Asp 



740 

Met Leu Glu Thr Thr Leu Val Ala Leu Gin Asp He Ser Leu Glu Lys 



755 



lie Phe Asp Asp Asn Gly Arg Lys Thr Leu Cys Ser Glu Phe Pro Gin 
770 775 

lie Met Gin Gin Gly Phe Ala Cys Leu Gin Gly Gly He Cys Leu Ser 
785 790 

S er Met Gly Arg Pro Val Ser Tyr Glu Arg Ala Val Ala Trp Lys Val 



805 



Le u Asn Glu Glu Glu Asn Ala His Cys He Cys Phe Val Phe He Asn 
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